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Sir: 

RESPONSE TO NOTIFICATION OF NON-COMPLIANCE UNDER 37 CFR 

1.192(c) 

In response to the Notification Of Non-Compliance With 37 CFR 1.192(c) dated 
May 1 0, 2004 ("Notification"), Appellants hereby submit a revised Brief on Appeal. 
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As an initial matter, the undersigned expresses thanks to Dr. Schwadron for 
discussing the Notification in a telephone conference (July 29, 2004) including its 
statement that Exhibit A of the Appeal Brief submitted August 23, 2003 contained "new 
evidence" as defined in MPEP 1207. 

Appellant respectfully disagrees that Exhibit A is "new evidence" under the cited 
MPEP section. For instance, MPEP 1207 is entitled "Amendment Filed With or After 
Appeal". Exhibit A is not an amendment. Instead, it is merely a drawing that shows for 
the sake of convenience to the Board, a TCR heterodimer and single-chain TCR. It is not 
seen how MPEP 1207 (relating to amendments) supports the Office's position that a 
drawing provided for convenience is "new evidence". 

However, to speed consideration of the Appeal Brief on the merits, Appellant 
agreed during the conference to withdraw Exhibit A. The enclosed Appeal Brief has 
been modified along this line. 

Appellant respectfully points out that Exhibit A has been part of Appellants' 
appeal for more than a year. During this time, Appellants received two notifications of 
alleged non-compliance and in no case did those communications refer to Exhibit A. 
Appellants would have preferred to address any "new evidence" issues earlier with the 
USPTO. However, the Examiner did mention during the conference that if Exhibit A is 
removed, the Appeal Brief would be in full compliance with the requirements of 37 CFR 
1.1 92( c). Appellants thank the Examiner for this consideration. 

To avoid all doubt, the undersigned contacted SPE Christina Chan to discuss the 
Notification. Appellant was assured that all outstanding compliance issues would be 
raised during the telephone conference with Dr. Schwadron. 

Accordingly, it is believed that the revised Brief (now without Exhibit A) is in full 
compliance with the rules. Consideration on the merits is now requested. However, if 



J. Weidanz, et al. 
USSN 08/813,781 
Page - 3 - 

the Office believes the Appeal Brief is still non-compliant, the undersigned would most 
appreciate a telephone call to discuss further issues. 

Included with the present submission is a Petition for Extension of Time for two 
months, ie. from June 10, 2004 to August 10, 2004. Also enclosed is the requisite 
petition fee 

Although it is not believed that any fee is needed to consider this submission, the 
Commissioner is hereby authorized to charge any fees which may be required to Deposit 
Account No. 04-1105. 



Respectfully submitted, 





Robert L. Buchanan (Reg. No. 40,927) 
EDWARDS & ANGELL, LLP 
P. O. Box 55874 
Boston, MA 02205 
Tel. (617) 439-4444 
Fax (617) 439-4170/7748 
Customer No.: 21874 
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REAL PARTY IN INTEREST 
The real party in interest is Altor Bioscience Corporation of Miramar, Florida. An 
assignment from the inventors to Dade International was recorded on August 18, 1997 at Reel/ 
Frame 8681/0081. An assignment from Dade International to Sunol Molecular Corporation was 
recorded on February 26, 2003 at Reel/Frame 013787/0276. An assignment from Sunol 
Molecular Corporation to Altor Bioscience Corporation was mailed to the USPTO on February 
3, 2004. 

RELATED APPEALS AND INTERFERENCES 
There are no related appeals or interferences known to Appellants or Appellants' 
representatives that will directly affect or be directly affected by or have a bearing on the Board's 
decision in the pending Appeal. 

STATUS OF THE CLAIMS 
Claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 stand finally rejected under 35 U.S.C. §103 and 
are pending on appeal. Claims 5, 6, 10-12, 16-20, 61, 65, 70, 73 and 74 have been cancelled. 
Claims 3, 9, 13, 15, 21-60, 62-64, 66 and 68 were withdrawn from consideration by the 
Examiner. 

STATUS OF THE AMENDMENTS 
Claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 on appeal are set forth in Exhibit A hereto. 
There has been no further amendment to the claims. 

SUMMARY OF THE CLAIMED INVENTION 
The claimed invention features a soluble fusion protein engineered to include a 
bacteriophage coat protein fused to a single-chain T cell receptor ("scTCR"). The single-chain T 
cell receptor was itself designed to include an alpha- variable region (" V-a") fused to a beta- 
variable region ("V-p"). The single-chain T cell receptor forms a pocket that binds antigen when 
the antigen. The claimed soluble fusion protein further includes a beta-constant region ("C-p") 
region that can be fused to V-P, for example. 
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T cells help defend the body against infection. The cells have membrane bound receptors 
that bind foreign antigen with the assistance of a protein complex called M MHC". A key receptor 
is called the T cell receptor ("TCR"). The chemical structure and function of the TCR has been 
extensively studied. For instance, it is known that formation of a TCR-antigen-MHC complex is 
an important step toward fighting infection. 

Appellants discovered that by adding a bacteriophage coat protein to the scTCR, it is 
possible to produce a fully soluble and functional scTCR. Unlike prior scTCRs, the claimed 
fusion proteins were found to be fully soluble, functional, and obtainable in significant quantities 
without difficulty. The claimed fusion proteins have a wide spectrum of important uses as 
described throughout the instant patent application. 

See the Summary Of The Invention at pg. 3, line 22 to pg. 11, line 21. See also the 
Background at pg. 1, line 12 to pg. 3, line 19 for related information. None of the art of record in 
this case shows an attempt to make a scTCR that includes a fused bacteriophage coat protein. 

ISSUE 

This appeal presents the issue of whether the Examiner erred in rejecting claims 1, 2, 4, 7, 
8, 14, 67, 69, 71 and 72 under 35 U.S.C. § 103 in view of Chung, S. et al. (1994) Proc. Natl 
Acad. Sci. (USA) 91: 12654 in view of U.S Pat. No. 5,759,817 to Barbas, Onda, T et al. (1996) 
Mol Immunol 32: 1387; and Huse et al. (1992) J. Immunol 149: 3914. Appellants will refer to 
these citations as "Chung", "Barbas", "Onda" and "Huse", respectively; unless stated otherwise. 
There are no other pending rejections of record in this case. 

GROUPING OF THE CLAIMS 
All of claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 stand or fall together for the purpose of 
the present appeal. 

CASE HISTORY 
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In consideration of the Examiner's position in this case, Appellants have summarized the 
prosecution history with respect to the 35 U.S.C. §103 rejection at issue. A more detailed 
discussion of Appellants 1 rebuttal to those arguments will follow under Argument . 



A. A patent application was filed on March 7, 1997 with 59 claims and assigned 
Serial No.: 08/813,781 by the USPTO. 



B. An Office Action was mailed to Appellants on August 25, 2000 by Examiner 
Schwadron. The Examiner stated the following with respect to the §103 rejection on appeal: 

Claims 1, 2, 4, 7, 8, 14, 67, 69, 71, 72 are rejected under 35 U.S.C. 103(a) 
as unpatentable over Chung et al. in view of Barbas US 5,759,817 (filed Jan. 27, 
1992), Onda et al. (Molecular Immunology 32:1387, 1995), and Huse et al. J. 
Immunology 149:3914. 1992 

Chung et al. teaches a single chain T cell receptor which specifically binds 
to peptide ligand (see abstract). Chung et al. further teaches one embodiment of 
human single chain TCR in which C-terminus of V a domain is linked to N- 
terminus of V P chain via a 15 amino acid residue flexible amino acid liner and 
the C-terminus of the V p chain is linked to the beta chain constant domain (see 
Figure 1). In one embodiment the C terminus of V p chain is linked to a alkaline 
phosphatase (PI) protein tag (see page 12655). Chung et also teach that the 
purpose of the linker is to enhance the binding characteristics of the soluble T cell 
receptor and that linkers of about 10 to 30 amino acid residues would be 
considered to be sufficient. Chung et al. teach that the TCR fusion protein can 
bind antigenic protein, thus teaching that the TCR fusion protein comprises an 
antigen binding pocket. Chung et al. teaches a TCR fusion protein comprising V- 
a-peptide linker-V p-C p linked to GPI anchor and expression of such a fusion 
protein in a transfected eukaryotic cell (see results section). Chung et al. disclose 
that the soluble form of TCR protein could be readily obtained by enzymatic 
cleavage with phosphatidylinostol-specific phospholipase C (PI-PLC) (see page 
12656). Chung et al. teaches expression of said TCR fusion protein in a bacterial 
cell system in which the N terminus of the Cp region is linked to a histidine 
protein tag. Chung et al. also disclose a scTCR in which comprises V-a-peptide 
linker-V p-C p GPI in which the Cp component consists of the p chain sequence 
ending right before the last cysteine (the sixth cysteine) (see page 12655). Chung 
et al. further teach that TCR fusion proteins which do not contain the CB do not 
fold into the native conformation. The scTCR disclosed by Chung et al. meet the 
length limitations of the Va. and Vp region recited in claims 69 and 71 . Chung et 
al. teach a soluble fusion protein comprising a Va-peptide linker-Vp-Cp 
fragment-protein tag (eg. GPI). Chung et al. does mot teach a TCR fusion 
protein further comprising bacteriophage VIII coat protein. 
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However, Barbas discloses a soluble fusion protein comprising a 
bacteriophage coat protein fragment covalently linked to a single-chain 
heterodimeric receptor (see abstract and column 15, lines 27-28, in particular). 
Barbas also discloses that the fusion protein may comprise domains of 
heterodimeric proteins derived from several ligand binding proteins, including 
immunoglobulins and T cell receptors (see column 17, lines 62-66 and column 19, 
lines, 9-28. Barbas discloses that T cell receptor comprises alpha and beta chains 
each having a variable(V) and constant(C) region and T cell receptor has 
similarities in genetic organization and function to immunoglobulins (see column 
19, lines 19-22, in particular). Barbas also teaches that bacteriophage coat protein 
may be derived from cpIII or cpVIII (see column 31, lines 10-28, in particular). 
Barbas discloses that expression vectors expressing soluble fusion proteins in 
which the ligand binding region is fused to bacteria coat protein allows the 
expression of the multiple fusion proteins on the surface of phage particles IE 
approximately 2700 cpVIII heterodimer receptor molecules per phage particle 
(see column 39 line 64 through column 40, line 7, in particular). Barbas further 
discloses that a short length of amino acid sequence at the amino end of a protein 
(IE a protein tag) directs the protein to periplasmic space (see column 8, lines 49- 
55, in particular. One embodiment of the invention is disclosed to be a fusion 
protein comprising in sequence a leader sequence-peptide linker-V region amino 
acid residue-peptide linker-phage coat protein and that in one embodiment, the 
second linker can define a proteolytic cleavage site which allows the 
heterodimeric receptor to be cleaved from the bacteriophage coat protein to which 
it is attached (see column 14, lines 60-65). Thins Barbas discloses but does not 
exemplify a soluble fusion protein comprising a bacteriophage coat protein 
covalently linked to T cell receptor domains. 

Onda et al. disclose a soluble fusion protein comprising a bacteriophage 
coat protein covalently linked to a single-chain T cell receptor by a peptide linker 
sequence wherein the single TCR chain is the alpha chain and the bacteriophage 
coat protein is cpVIII (see abstract and Figure 1, in particular). Onda et al. also 
teach that TCR-bacteriophage coat protein fusion protein can be used to study 
specific binding interactions of the TCR chain to antigenic ligands (see paragraph 
bridging pages 1394-1395, in particular). 

Huse et al. teach that fusion proteins comprising a single chain fusion 
protein comprising Fab fragment of immunoglobulin (which comprises the 
antigen binding pocket of the immunoglobulin molecule) and bacteriophage VIII 
coat protein can be produced and display the fusion protein when expressed in a 
Ml 3 derived vector. Huse et al. further teach that bacteriophage VIII coat protein 



Examiner Schwadron withdrew this statement in the next Office Action dated June 17, 2002 (see below). The 
statement was originally made by a prior Examiner (Lubet) in a related §103 rejection that has been withdrawn. In 
that earlier rejection, Lubet argued that Barbas and Onda do not teach a soluble fusion protein in which a single- 
chain TCR linked to a bacteriophage coat protein. See the Office Action dated June 23, 1998 at pg. 8, part B. In the 
§103 rejection on appeal, Examiner Schwadron argued that Barbas and Onda teach use of TCR -bacteriophage VIII 
fusion protein. That molecule, a heterodimeric T cell receptor fusion protein, is not the claimed invention. 
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fusion protein can recovered from culture medium or from the periplasmic space 
(see abstract). 

Therefore it would have been prima facie obvious to one with ordinary 
skill in the art at the time the invention was made to make a soluble TCR fusion 
protein comprising the Vol. -pep tide linker-VP -Cb fragment-protein taught by 
Chung et al. linked to a bacteriophage VIII coat protein because Barbas et al. 
and Oada et al. teach TCR-bacteriophage VIII coat fusion* proteins can be 
used to stindy antigen binding properties of such a fusion protein and Huse et 
al. teach that fusion proteins comprising bacteriophage VIII coat protein can be 
produced in bacteria and recovered in relatively large quantities. 

One with skill in the art would be motivated to make such a fusion protein 
to study the antigen binding region of the TCR component or to use the protein to 
elicit anti-idiotypic antibodies. One with skill in the art would be motivated to 
make such a fusion protein in which the Va and VP region was derived from 
human TCR in order to study human TCR properties or to elicit anti-idiotypic 
antibodies to the TCR component of the protein. 

The preceding grounds of rejection have been maintained since the August 25, 2000 
Office Action despite Appellants' rebuttal argument and claim amendments discussed below. 



C. On October 25, 2000, Appellants 1 representative met with Examiner Schwadron at 
the USPTO and discussed the art cited. No agreement was reached. 



D. Appellants submitted a response to the rejection set forth in paragraph B, above, on 
February 22, 2001 . In that response, Appellants rebutted the prima facie obviousness argument 
by pointing out: 1) that the cited references did not teach or suggest that the membrane "anchor" 
of Chung's single-chain TCR (GPI: a membrane protein) could be substituted with the 
bacteriophage coat protein of Barbas 1 TCR; 2) that there was no reasonable expectation that the 
substitution (switching Chung's anchor for Barbas 1 phage coat protein) could be achieved in view 
of substantial differences between scTCRs and TCRs; and 3) that the Examiner's citation of 
Onda was not correct ie., it does not disclose TCR-bacteriophage coat protein fusions, but 
instead, dwelt on smaller constructs having only a V-a chain (but no V-J3 chain). Onda 
characterized his constructs as having "unusual" binding properties that were not characteristic of 
TCRs. A subset of such constructs were reported by Onda not to work at all. 
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Appellants also discussed the Holler reference: a peer-reviewed scientific article from 
the U.S. Academy of Sciences (PNAS (USA) (2000) 97: 5387 at 5389). Holler provided 
independent and objective evidence of the long-felt need and failure of others in the field to 
make and use the claimed fusion molecules. Specifically, Holler stated that phage display had 
not yet proven successful in the engineering of scTCRs. 2 

E. In response to the Appellants' arguments in paragraph D above, the Examiner 
issued a Final Office Action dated June 17, 2002. 3 The Examiner maintained the prima facie 
rejection and stated: 



Regarding appellants comments, while heterodimeric molecules are a 
preferred embodiment disclosed in Barbas et al., Barbas et al. disclose: 
"In another embodiment, the present invention contemplates a polypeptide 
comprising an insert domain flanked by an amini-terminal secretion signal 
domain and a carboxy-terminal filamentous phage coat protein membrane anchor 
domain." (column 14, first complete paragraph). 

Barbas et al. further disclose than said construct could include a "receptor 
protein" (column 14, second paragraph), indicating that the disclosed method 
could be used for receptors per se (eg. single chain or heterodimeric or single 
chain heteromers). Single chain T cell receptors were known in the art (see Chung 
et al.). 

Regarding appellants comments about the single chain TCR taught by 
Chung et al., 

Chung et al. teach that the GPI anchor is cleaved and the soluble TCR still 
has all the antigen binding properties of the TCR (see pages 12656-12658). Thus, 
the GPI anchor is not required for the soluble TCR to function, it is just used in 
one particular method of making the soluble TCR. Regarding motivation to create 
the claimed invention, Chung et al. discloses that it would be desirable to 
produce ttoeir TCR inn a phage display system (see page 12658, first cokmit). 
In addition, Barbas et al. teach the advantages of their system for the production 



By "phage display" is meant the process of making a recombinant bacteriophage expressing the scTCR as part of 
the phage protein coat. After infecting bacteria with a recombinant phage engineered to produce the scTCR, the 
protein would be "displayed" on the bacterial cell surface as bacteriophage. The scTCR "displayed" in this manner 
would be amendable to engineering. 

3 The Examiner essentially repeated his rejection of the claims as set forth in the August 25, 2000 rejection. But see 
footnote 1 . With respect to Examiner Schwadron's discussion about Holler, Appellants submitted Weidanz et al. as 
part of a Rule 132 Declaration to address an obviousness rejection that has since been withdrawn. The reference 
provided evidence that a particular scTCR-bacteriophage coat protein vector (pKC44) was capable of forming an 
antigen binding site when expressed. The reference is not prior art to the present application. 
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of peptides. Regarding reasonable expectation of success, both Barbas et al. and 
Chung et al. disclose use of phage display systems to produce single chain 
antibodies (see column 2, third paragraph from bottom and page 12658, first 
column). In addition, the soluble single chain TCR molecules functions with or 
without the GPI linker indicating that the construct itself is functional. 

Regarding appellants comments about Holler et al., said publication was 
published in May 2000. In the amendment filed 6/3/2000, applicant submitted a 
publication by Weidanz et al. (J. Imm. Methods 1998) which discloses the 
claimed invention. Thus, it appears that Holler et al. simply are mot familiar 
with the prior art. Thus, the comments of Holler et al. carry bo weight 
because two years prior to the Holler et al. publication, Weidamz et al. had 
already published data regarding the production of single chain TCR using 
bacteriophage. Furthermore, Holler et al. discloses a yeast system for producing 
a single chain TCR and it appears that the main focus of Holler et al. is to promote 
their system. 

Regarding appellants comments about Onda et al., the instant rejection 
indicates that "Onda et al. disclose a soluble fusion protein comprising a 
bacteriophage coat protein covalently linked to a single-chain T cell receptor 
by a peptide linker sequence wherein the single TCR chain is the alpha chain and 
the bacteriophage coat protein is cpVIII (see abstract and Figure 1, in particular)". 
The art recognizes that the alpha and beta chains of the TCR generally both are 
involved in antigen binding. The art also recognizes that soluble TCR which bind 
antigen would have a variety of uses. 

F. Appellants filed a Notice of Appeal on October 17, 2002. 

ARGUMENTS 

As an initial mater, Appellants wish to emphasize the substantial differences between 
TCR heterodimers ("TCRs") and single-claim T cell receptors ("sc-TCRs"). 

The TCR is a heterodimer with one a chain and one p chain. 4 Each of these chains 
passes from the exterior of the T cell, through the cell membrane, and into the cell interior 
(cytosol). The a and (3 chains each include a variable (V-a, V-p) region that cooperate to form an 
antigen binding pocket. The regions are "variable" because its chemical structure can be 



A textbook in the field describes the TCR as "a heterodimer composed of an a and a P polypeptide chain, both of 
which are glycosylated." See Alberts, B et al. ( 1 989) in Molecular Biology of the Cell, 2 nd Ed. Garland Publishing, 
Inc. New York at pg. 1037. By convention, a "heterodimer" such as the TCR properly has two chains (dimer) both 
of which are different (hetero) from the other. 
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changed to make a pocket that fits another antigen. Each of the V-a and V-p regions are 
associated with a constant (C) region. 

TCR heterodimers have been extremely difficult to isolate from T cells. This problem 
has hindered study of the receptor. One approach to address the problem has been to make 
single-chain T cell receptors ("scTCRs"). These synthetic receptors include, on one chain 
instead of two, a fused V-a and V-p region. It has been customary to space the V-a and V-p 
regions from each other with a flexible linker to allow the regions to make an antigen binding 
pocket. Unfortunately, many scTCRs have still proven to be difficult to make and use. 

Appellants point out that "TCR" is understood in the field to mean a heterodimeric T 
cell receptor. The TCR is a membrane bound (insoluble) receptor in which the a and p chains 
cooperate to bind antigen. Reference to a "scTCR" is understood to mean a synthetic single- 
chain molecule that includes the V-a and V-p regions bound together usually through a flexible 
linker. Unlike the TCR, the scTCR binds antigen with only one chain. The TCR and scTCR are 
structurally distinct proteins that are different molecules that bind antigen in different ways. 

I. Summary of the Cited Art 

A. Chung reports functional three-domain single-chain T cell receptors consisting of a 
human Va and VP region that recognizes a particular antigen (HLA-DR2b/myelin basic protein). 
Chung determined that it was important to fuse a Cp region to the Vp region. Such a three 
domain construct, when linked to a synthetic cell membrane anchor (glycosyl phosphatidyl- 
inositol (GPI) or CD3£ fragment), was found to be expressed and functional. Chung disclosed 
that the cell membrane anchor could be cleaved from the single-chain receptors to obtain soluble 
protein. See the Abstract. 

Chung opined that his single-chain design "may allow 11 construction of TCR phage 
libraries and that such libraries "may be" tools for studying TCRs. See pg. 12658. However, 
there is no specific disclosure in Chung about how such libraries could be made or, if made, 
whether his single-chain TCRs could tolerate fusion of the bacteriophage coat protein. Chung 
does not report or suggest that the GPI or CD3^ membrane anchor could be substituted with a 
bacteriophage coat protein. Even if there was such a teaching, there is no disclosure in the 
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reference about whether a recombinant bacteriophage could tolerate Chung's scTCR as part of 
the phage coat. 

B. Barbas discloses heterodimeric receptor libraries that use phagmids. In the Abstract, 
phage are taught to encapsulate a genome encoding first and second polypeptides of a receptor 
such as an antibody; in which the first and second polypeptides are integrated into the coat 
matrix of the phage. Barbas generally discloses that such phage may include a polypeptide with 
an "insert domain" that has a receptor domain flanked by a secretion signal domain and a phage 
coat protein membrane anchor domain. Col. 14, lines 10-14. Heterodimeric receptors are 
preferred. See Col. 3, lines 1-41; Col. 14, lines 15-29; and Col. 15, lines 28-32. According to 
Barbas, there was some uncertainty in the field about which portions of bacteriophage coat 
proteins were needed for phage assembly. Col. 2, lines 19-46. 

Barbas does not teach how to make or use a scTCR with or without a fused bacteriophage 
coat protein. 

C. Onda reports use of a phage display system to explore binding interactions between 
the V-a region and antigen. Onda did not disclose use of the system to study TCR or single- 
chain TCR interactions. In the Abstract, Onda provides at pg. 1387: 

We utilized an Ml 3 phage display system, designed for multivalent 
receptor display, to explore specific binding interactions between various TCR a 
chains and specific antigen in the absence of MHC. 

That is, Onda fused only the V-a region to bacteriophage coat protein. The constructs do not 
include a V-p region and are not scTCR fusion proteins. Onda's fusions are much smaller TCR "half- 
molecules" lacking the V-P region and antigen binding pocket of Appellants 1 scTCR. 

Onda at pg. 1395, col. 1, cautioned that his constructs were mmuiSMal and not typical of 

standard TCR interactions: 

Our results extend these findings by demonstrating that the dominant interactions 
of certain TCRct chains for peptide antigens may be sufficiently high that they can be 
analysed independently. However, these interactions are quite unusual in that they do 
not require the expression of the second TCR subunit or normal MHC and coreceptor 
interactions. These results may raise concern that this model does not reflect typical 
TCR-ligand interactions. 
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Significantly, only some of Onda's Va chain fusion proteins were reported to bind 

antigen when fused to bacteriophage coat protein. At pg. 1395, col. 2 he states that: 

...only a subset of TCR Va have capacity for direct interactions with antigen 
strong enough to be detectable in this system. 

Onda does not teach or suggest fusing a scTCR (V-a and V-0) to a bacteriophage fusion 

protein. 

D. Huse described a phage vector system for screening and producing antibody F(ab) 

fragments. 5 Huse ! s system was reportedly used to produce free F(ab) and F(ab) displayed on the 

surface of bacteriophage. According to Huse however, not all attempts to produce F(ab) were 

successful. In more than a few instances, the recombinant bacteriophage made to produce the 

fusions apparently would not tolerate certain amounts of antibody protein. In describing 

attempts to display certain antibody H and L chains with his phage vector system, Huse stated on 

pg. 3919, col. 2 that: 

Phage titers of [phage vector] infected cultures were found to decrease 
relative to the level of F(ab)-pVIII fusion protein incorporation (cite omitted). 
Taken together, these results suggest that a functionally viable phage particle 
may be able to tolerate a limited Number off incorporated F(ab)-pVIII fusion 
products and that the amount off F(ab) incorporated into the phage coat may 
adversely affect phage titers and overall F(ab) yield. 

Huse does not disclose fusing a bacteriophage coat protein to a scTCR or TCR. 

II. Summary of the Examiner's Argument 

Grounds for the present rejection under 35U.S.C§103 were formulated in the Office 
Action dated August 25, 2000. See paragraph B, above. The basis for the rejection has not 
changed substantially in the face of Appellants' arguments and claim amendments. 

According to Appellants' understanding of the Examiner's alleged prima facie case, the 
primary references, Chung and Barbas, are alleged to teach a scTCR linked to a bacteriophage 
fusion protein. Onda and Barbas are relied on to teach that TCR-bacteriophage fusion proteins 
can be used to study antigen binding. Huse is used to teach that fusion proteins with the coat 



F(ab) is an abbreviation for an antigen binding fragment of an antibody ( fragment antigen binding). F(ab) is a 
heterodimer consisting of two different chains ie., the antibody light and heavy chain. F(ab) is readily made by 
cleaving whole antibodies with specific proteolytic enzymes. 
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protein can be made in bacteria. The foundation of the Examiner's position is that because 
Barbas, Onda, and Huse teach some bacteriophage coat protein fusions, then it would be obvious 
to make fusions with Chung's scTCRs. Although facially somewhat plausible, the rejection is 
flawed on both scientific and legal principles as discussed herein. 

III. The Examiner Erred in Rejecting Claims h 2. 4. 7, 8, 14, 67, 69, 71 and 72 as Being 
Obvious 

A, Reqmiremeinits of the prima fade case and its maiEteEamce. 

The Examiner erred in maintaining the obviousness rejection in the face of claim 

amendments and the state of the art as submitted made in this case and its parent. The Federal 

Circuit has reiterated that an Examiner's prima facie case is but a procedural tool of patent 

examination, with the express purpose of allocating the burdens of going forward as between the 

Examiner and Applicant. See In re Deckler 977 F.2d at 1449, citations omitted): 

Specifically, when obviousness is at issue, the examiner has the burden of persuasion 
and therefore the initial burden of production. Satisfying the burden of persuasion, 
constitutes a so-called prima facie showing. Once that burden is met, the applicant has 
the burden of production to demonstrate that the examiner's preliminary determination is 
not correct. The examiner, and if later involved, the Board, retain the ultimate burden of 
persuasion on this issue. 

Clearly, as demonstrated herein, adequate evidence of the unobviousness of the claimed 
invention was provided by Appellants to shift the burden of persuasion to the Examiner. 

In view thereof, it is requested that the Board review the obviousness question based on 
the invention as claimed, and the cited references, including all relevant parts thereof. 

B. Standard For Reviewing Am Obviomsniess Rejection minder 35 USC §103. 

The Federal Circuit has reiterated the manner in which obviousness rejections are to be 
reviewed. Where claimed subject matter has been rejected as obvious in view of a combination 
of prior art references, "a proper analysis under section 103 requires, inter alia, consideration of 
two factors: (1) whether the prior art would have suggested to those of ordinary skill in the art 
that they should make the claimed composition or device, or carry out the claimed process; and 
(2) whether the prior art would also have revealed that in so making or carrying out, those of 
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ordinary skill would have a reasonable expectation of success." In re Vaeck , 947 F.2d 488, 493, 
20 U.S.P.Q.2d 1438, 1442 (Fed. Cir. 1991), cited In re Dow Chemical Co ., 837 F.2d 469, 473, 5 
U.S.P.Q.2d 1529, 1531 (Fed. Cir. 1988). As the Federal Circuit emphasized by succinctly 
summarizing: "Both the suggestion and the reasonable expectation of success must be founded in 
the prior art, not the Applicants 1 disclosure." Id. See also In re Merck & Co., Inc., 800 F.2d 1091, 
231 USPQ 375 (Fed. Cir. 1986). 

More recently, the Federal Circuit has reviewed the case law regarding 35 U.S.C. §103. 
See InreSang-Su Lee 277 F.3d 1388, 61 U.S.P.Q.2d 1430 (Fed. Cir. 2002). 

Should the Board adopt the Examiner's prima facie case, Appellants submit that the 
claimed invention would not have been obvious in view of the legal standard summarized above. 

C. No Prima Facie Case off Ofovioinsmess 

As noted above, the foundation of the Examiner' argument rests on the belief that it 
would be obvious to fuse Chung's scTCR to the bacteriophage coat protein of Barbas because, 
allegedly, Barbas and Onda teach TCR-bacteriophage coat fusion proteins and Huse discloses 
that fusion protein with such a coat can be made in bacteria. 

For the Examiner's prima facie case to stand, it is imperative that he establish that: 1) 
The cited references disclose or suggest fusing a bacteriophage coat protein to Chung's scTCR; 
2) there is a settled role for the bacteriophage coat protein in making fusion proteins; and that 3) 
one could fuse Chung's scTCR to Barbas' coat protein with a reasonable expectation of success. 

The Examiner's position is not supported by any of these points. Barbas, as relied on, 
does not teach or suggest a scTCR or even fusion of a scTCR to a bacteriophage coat protein. 
Onda and Huse, when read in their entirety, exemplify uncertainty in the field about using the 
bacteriophage coat protein to make certain fusion proteins. Even Barbas admitted that there was 
some doubt about how much one could change certain bacteriophage coat proteins without 
hindering phage assembly. Moreover, some of Onda's and Huse' molecules did not work well. 
Others did not work at all. On top of that uncertainty is heaped additional doubt about whether 
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Chung's "anchor" fragments could be substituted with the bacteriophage coat protein of Barbas. 
There was also doubt about whether the bacteriophage would tolerate fusion of the scTCR to its 
coat. 

i) Barbas and Onda does not teach or suggest a single-chain TCR (scTCR) 
In the Office Action dated August 25, 2000, Examiner Schwadron took the position that 
Barbas discloses: 

soluble fusion protein comprising a bacteriophage coat protein fragment covalently 

linked to a simtgle-clhaiini heterodnmeric receptor (see the abstract and column 15, lines 

27-28, in particular). Barbas also discloses that the fusion* protein may comprise 

domains of hef erodimeric proteins derived from several ligand binding proteins, 

including immunoglobulins and T cell receptors (see column 1 7, lines 62-66 and column 

19, lines, 9-28. Barbas discloses that T cell receptor comprises alpha and beta chains each 

having a variable(V) and constant(C) region and T cell receptor has similarities in genetic 

organization and function to immunoglobulins (see column 19, lines 19-22, in particular). 
* * * * 

Thus Barbas discloses but does not exemplify a soluble fusion protein comprising a 
bacteriophage coat protein covalently linked to T cell receptor domains 

The heterodimeric receptor proteins pointed out by the Examiner are not scTCRs. 
Heterodimeric proteins, and particularly the TCR of Barbas, are understood in the field to consist 
of two different a and p chains. Brief at pg. 1. Unlike the TCR, the scTCR of Appellants 1 
claimed invention is a single-chain molecule with a V-a chain fused to a V-P chain. The position 
that Barbas discloses a "single-chain heterodimeric" receptor simply makes no sense. How can a 
single-chain molecule be a "heterodimer" when that requires two (dimer) different (hetero) 
chains? Brief at pg. 1 and footnote 1. Barbas could not have had the single-chain constructs of 
Onda and Chung in mind. Those references were published well after the priority date of the 
Barbas patent. Thus, nowhere in the reference is there any disclosure about how to make or use 
a scTCR. 

Faced with this rebuttal, the Examiner took the position in the Final Office that Barbas 1 
disclosure of "polypeptides comprising an insert domain" and "receptor proteins" should be read 
to include Chung's scTCRs. Also included in that sweeping reading of Barbas are "single chain 
or heterodimeric or single chain heteromers". That position is without merit. Too much is read 
from Barbas. It does not provide for any scTCR molecules. If the Examiner's overly-broad view 
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of the patent is allowed to stand and sweep in scTCRs, even though Chung's were published well 
after Barbas ! priority date, it will preempt any attempt to obtain patent protection for scTCR- 
bacteriophage coat fusion proteins. A principle focus of Barbas was to provide heterodimeric 
receptors linked to a phage coat protein. See the Title of the patent, the Abstract and col. 3, lines 
1-41, for instance. Such receptors are not the fusion proteins Appellants claim and there is no 
suggestion in Barbas to make or use them. 

Even assuming, arguendo, that the Examiner is correct and that Barbas taught or 
suggested a scTCR (years before Chung or Onda were published), one reading Barbas in that 
way would be confused in light of the accepted understanding in the field that a heterodimer such 
as the TCR is a complex of two different polypeptide chains. Brief at pp. 1 - 2 and footnote 1. 

As captioned above, Onda does not disclose a TCR or scTCR fusion to bacteriophage 
coat protein as alleged by the Examiner in the August 25, 2000 and June 17, 2002 Office 
Actions. Instead, Onda reports fusion of TCR a cfaainns to bacteriophage coat protein. The TCR 
a chain is merely a part of the larger scTCR Appellants worked with. That is, the prior 
constructs are significantly smaller (and less likely to cause solubility problems when fused to 
coat proteins) than the scTCR fusions Appellants successfully made. 

Moreover, the Examiner ignored Onda's clear hesitation about reading too much from 
TCR a chain constructs that include a fused bacteriophage coat protein. According to Onda, the 
interactions of the constructs were MmusEal and not typical off TCR-ligaiid imteractioES. See 
above and Onda at pg. 1395, col. 1. 

Importantly, only some of Onda's TCR a chain constructs even worked to bind antigen. 
See above and pg. 1395 of Onda at col. 2, second full paragraph. 

According to Onda then, some TCR a chain-bacteriophage coat protein fusions work and 
some do not. Those that do work were viewed as "unusual" and "not typical". In view of this 
caution, one working in this field would not be encouraged to fuse a bacteriophage coat protein 
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to a scTCR. None of the other cited references shed any light on Onda's clear hesitation to 
extend there findings to other TCR molecules. 



The Examiner thus erred in trying to formulate a prima facie case by not giving due 

weight to all relevant portions of Onda. Contrary to this practice, it is well established that the 

Examiner must consider all relevant portions of cited references, including those portions which 

substantially weaken her position. In particular, the former CCPA stated in In re Mercier 515 

R2d 1161, 185 USPQ at 778: 

The relevant portions of a reference include not only those teachings which would 
suggest particular aspects of an invention to one having ordinary skill in the art, 
but also those teachings which would lead such a person away from the claimed 
invention. 

See also Phillips Petroleum Co. v. U.S. Steel Corp .. 673 F.Supp. 1278, 1315, 6 USPQ2d 
1065, 1093 (D.Del. 1987), affd, 865 F.2d 1247, 9 USPQ2d 1461 (Fed. Cir. 1989). 

The Board is thus urged to take Onda in its entirety and to consider all relevant portions 
of it including the passages quoted above. Read in this way, as it should, the reference would 
lead one in this field to doubt whether it would be feasible to fuse a bacteriophage coat protein to 
a scTCR to produce a soluble and functional fusion protein. 

ii) Huse reported difficulties producing some bacteriophage coat protein fusions 

The Huse reference, as quoted above, reported that not all F(ab)-pVIII (bacteriophage) 
coat proteins could be made at high titre. That is, Huse stated that the bacteriophage may not 
tolerate some amounts of F(ab) constructs, thereby decreasing phage titres and overall F(ab) 
yield. See above and Huse at pg. 3919, col. 2. When Huse is read in its entirety, as it should, 
the Examiner's statement that "Huse et al. teach that fusion proteins comprising bacteriophage 
VIII coat protein can be produced in bacteria" is an unsupported generalization. Huse clearly 
found that some amounts of heterodimeric F(ab) constructs harmed the bacteriophage that 
carried them. In view of this warning, a worker in the field would have good reason to doubt 
whether a bacteriophage could be fused to a scTCR or even a heterodimer such as a TCR without 
considerable experimentation. 
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The Examiner took the position in the Final Office Action that Barbas and Chung provide 

a reasonable expectation that one could make the claimed fusion proteins: 

Regarding reasonable expectation* of success, both Barbas et al. and Chung et 
al. disclose use of phage display systems to produce single chain antibodies (see 
column 2, third paragraph from bottom and page 12658, first column). 

However as clearly illustrated by Huse, not all phage display systems using 
antibodies work as expected. Some amounts of heterodimeric F(ab) antibodies cause 
problems. Thus the Examinees position is not supported by the art of record in this case. 

The Board is thus requested to take Huse in its entirety and to consider all relevant 
portions of it including the passage quoted above. In re Mercier 515 F.2d 1 161, 185 USPQ at 
778; and Phillips Petroleum Co. v. U.S. Steel Corp .. 673 F.Supp. 1278, 1315,6USPQ2d 1065, 
1093 (D.Del. 1987), affd 9 865 F.2d 1247, 9 USPQ2d 1461 (Fed. Cir. 1989). 

The substantial uncertainties raised by Onda and Huse have not been addressed by the 
Examiner. No objective scientific work has been made of record to resolve or explain them. 
Read in their entirety, as they should, Onda and Huse point out problems about making and using 
some bacteriophage coat protein fusions. Even if one skilled in this field were to read Onda and 
Huse selectively and disregard their warnings, there is still nothing in the art relied on to suggest 
that one could make or use a scTCR fusion to a bacteriophage coat protein. Even Barbas 
admitted that there was uncertainty about what coat protein parts could be manipulated for phage 
assembly. 

In marked contrast, Appellants have demonstrated that it is possible to fuse a 
bacteriophage coat protein to the scTCR and obtain fully soluble and functional fusion protein. 
See Appellants' patent specification at Example 1 (showing construction of soluble scTCR fusion 
proteins); Examples 2-3 (production of special vectors to make the scTCR fusion proteins); 
Example s 4-5 (expression of Appellants 1 soluble scTCR fusions); Example 6 (purification of 
the soluble scTCR fusion proteins); Examples 7-11 and 16 (characterization of particular scTCR 
fusion proteins); and Example 15 (analysis of a bacteriophage library expressing Appellants' 
scTCR). 
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Accordingly, the §103 rejection fails both prongs of the Federal Circuit test for 
determining obviousness. See In re Vaeck , supra; and In re Dow Chemical Co .. supra. 
It is submitted that the Board reverse the obviousness rejection in light of this test. 

Whether or not the Examiner is taking the position that it would be obvious to try 
to make the claimed scTCR fusion proteins, both the Board and Federal Circuit have 
made it quite clear that this is not a burden that Appellants must bear. In particular, the 
Court in In re O'Farrell 7 USPQ 2d 1673 (1988) held at page 1681: 

The admonition that "obvious to try" is not the standard under §103 has been 
directed mainly at two kinds of error. In some cases, what would have been 
"obvious to try" would have been to vary all parameters or try each of numerous 
possible choices until one possibly arrived at a successful result, where the prior 
art gave either no indication of which parameters were critical or no direction as 
to which of many possible choices is likely to be successful. 

See also Ex parte Old . 229 USPQ 196, 200 (1985). 

While the Court in In re O'Farrell went on to state that while obviousness does not 
require absolute predictability of success, what is required under §103 is a reasonable 
expectation of success. 

Thus whether those in the field may have been tempted to fuse a bacteriophage 
coat protein to a scTCR, the field would have been cautioned from doing so in view of 
the warnings of Onda, Huse and to some extent even Barbas. The Barbas patent does not 
disclose or suggest any scTCR fusion to the coat protein. Chung does nothing to remedy 
these defects. In short, the field's unsuccessful experience with some amounts of 
antibody heterodimers (Huse) and some single-chain constructs (Onda), provides at worst 
no basis for believing that fusion of a bacteriophage coat protein to an scTCR will work 
and at best, a reason to doubt that such a fusion will result in a fully soluble and 
functional protein. 
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iii) No teaching or suggestion that Chung's "anchor" fragment could be 
substituted with Barbas' bacteriophage coat protein 

Chung reported TCRs linked to a cell membrane anchor (glycosysl phosphatidylinositol 
(GPI) or murine CD3 C, chain). The anchor apparently helps to express the single-chain TCRs. 
The anchor molecules are entirely different from the coat proteins of Barbas both in terms of 
chemical structure and function. For example, Chung's anchors are hydrophobic cell membrane 
proteins while those of Barbas are relatively more hydrophilic bacteriophage coat components. 
Chung's anchor apparently plants the scTCR in the membrane while the coat envelops the phage. 

The Examiner has pointed to no teaching or suggestion in the cited art that Chung's 
anchor molecules could be substituted with Barbas* bacteriophage coat proteins. The 
obviousness rejection falls far short of establishing any nexus between Chung's anchors, which 
are attached to his scTCRs, and the coat proteins reported by Barbas. 

iv) Objective Evidence of Non-obviousness 

In addition to the lack of a prima facie case of obviousness, the strong objective 
evidence of non-obviousness presented during prosecution of this case further compels 
allowance of the claims. 

Evidence of such objective indicia of non-obviousness, the so-called "secondary 
considerations" must be considered in all obviousness determinations. Stratoflex, Inc. v. 
Aeroquip Corp.. 713 F.2d 1530, 1538-1539 (1983): 

Indeed, evidence of secondary consideration may often be the most probative and 
cogent evidence in the record. It may often establish that an invention appearing 
to have been obvious in light of the prior art was not. It is to be considered as part 
of all the evidence, not just when the decision-maker remains in doubt after 
reviewing the art. 

See also Graham v. John Deere , 383 U.S. 1, 148 USPQ 459 (1966). 



18 



This standard set forth by the Federal Circuit applies not only during litigation of 
issued patents, but to a determination of patentability during ex parte prosecution as well. 
In re Semaken 702 F.2d. 989 217 USPQ 1, 7 (Fed. Cir. 1983). However, in the instant 
case, the Examiner is not properly considered evidence of "long felt need and failure of 
others" in maintaining the present § 103 rejection. 

Specifically, Appellants' provided the Holler reference as indicating that the field 
longed to make the claimed fusion proteins but could not. Holler reported that phage 
display had not yet proven successful in making scTCRs despite what he saw as 
extensive structural similarity between antibodies and TCR V regions. Appellants' 
invention addressed this need and succeeded by providing soluble fusion molecules with 
a bacteriophage coat protein linked to the scTCR. The Holler reference is highly 
probative of the difficulties the field had in making these molecules and should be given 
substantial weight by Examiner Schwadron. MPEP 716.01(b). 

The Examiner completely dismissed the Holler reference on grounds that "Holler 
et al. simply not familiar with the prior art". See Part D, above. That is no basis for 
disregarding the Holler's statement that the field wanted but failed to produce the claimed 
invention. Appellants are under no burden to provide evidence of Holler's knowledge of 
the art in order to have the reference considered as objective indicia of non-obviousness. 
See Stratoflex, Inc. v. Aeroquip Corp. , 713 F.2d 1530, 1538-1539(1983); In re Sernaker , 
702 F.2d. 989 217 USPQ 1, 7 (Fed. Cir. 1983); and MPEP 716.01(b). 

Moreover, Examiner Schwadron's citation of Weidanz et al. to support his 
disregard of Holler is clearly improper. That reference is not prior art and cannot serve as 
a basis for ignoring Holler or substantiating the obviousness rejection on appeal. 

It is requested that the Board consider Holler as objective evidence that workers in 
field wanted, but could not make, the claimed invention. 

CONCLUSIONS 
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For the Examiner's prima facie case to stand, he has the burden of showing that: 
1) The cited references disclose or suggest fusing a bacteriophage coat protein to a 
scTCR; 2) there is a settled role for the bacteriophage coat protein in making fusion 
proteins; and that 3) one could fuse Chung's scTCR to Barbas' coat protein with a 
reasonable expectation of success. These points have not been made by the Examiner. 
As discussed above, Barbas does not disclose scTCRs. Moreover, there was significant 
uncertainty in the field about whether it was possible to fuse a bacteriophage coat protein 
to a scTCR as exemplified by Huse and Onda. In view of the cited art and in 
consideration of the Examiner's position, it could be argued that one might be motivated 
to test fusing the bacteriophage coat protein to Chung's scTCR in the hope of producing a 
soluble and function protein. But this is not the legal standard required by our case law. 
It is without a doubt not obvious from the art of record to make the claimed invention of a 
scTCR fused to a bacteriophage coat protein. 

Appellants submit that they have overcome the Examiner's obviousness rejection 
in the view of all the facts and argument of record in this case. Simply put, one of skill in 
this area would not be able to predict, with any reasonable expectation of success, how to 
make and use the claimed invention. 

Importantly, Appellants have provided experimental evidence clearly showing that it is 
indeed possible to make and use scTCR-bacteriophage coat protein fusions. See Examples 1-11, 
15 and 16 as discussed above. 

In summary, Appellants submit that the instant invention is both novel and unobvious. 
The arguments set forth above establish that non-obviousness. 
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Although it is not believed that the present submission requires any fee for 
consideration by the Office, the Examiner is authorized to charge such fee to our deposit 
account 04-1105 should such fee be deemed necessary. 



Date: A*j*%r~<T J? 0*f 



Respectfully submitted, 




Robert L. Buchanan (Reg. No. 40,927) 

EDWARDS & ANGELL, LLP 

P. O. Box 55874 

Boston, MA 02205 

Tel. (617) 439-4444 

Fax (617) 439-4170/7748 

Customer No.: 21874 
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EXHIBIT A 
Claims 1, 2, 4, 7. 8, 14. 67, 69, 71 and 72 



on appeal 



What is claimed is: 

1 . A soluble fusion protein comprising a bacteriophage coat protein covalently linked to 
a single-chain T cell receptor comprising an antigen binding pocket, wherein the single-chain T 
cell receptor comprises a V-ot region covalently linked to a V-p region by a peptide linker 
sequence that effectively positions the V-a region and the V-p region to form the antigen 
binding pocket, the soluble fusion protein further comprising a C-P region fragment. 

2. The soluble fusion protein of claim 1, wherein the C-terminus of the V-a region is 
covalently linked by the peptide linker sequence to the N-terminus of V-P region. 

4. The soluble fusion protein of claim 2 wherein the C-P region fragment is covalently 
linked between the C-terminus of the V-P region and the N-terminus of the bacteriophage coat 
protein. 

7. The soluble fusion protein of claim 2, wherein the peptide linker sequence 
contains from approximately 2 to 20 amino acids. 

8. The soluble fusion protein of claim 1 , wherein the bacteriophage coat protein is 
gene III or gene VIII protein. 

14. A soluble fusion protein comprising covalently linked in sequence: 1) a V-a 
region, 2) a peptide linker sequence, 3) a V-p region covalently linked to a C-p region fragment, 
and 4) a bacteriophage gene VIII protein, wherein the peptide linker sequence effectively 
positions the V-a region and the V-p region to form an antigen binding pocket. 

67. The soluble fusion protein of claim 1, wherein the C-terminus of the V-p region 
is covalently linked to the N-terminus of a C-p region fragment. 
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69. The soluble fusion protein of claim 1, wherein the V-ot region and the V-p region 
are about 200 to 400 amino acids in length. 

71 . The soluble fusion protein of claim 1, wherein the C-p region fragment is about 
50 to 126 amino acids in length. 

72. The soluble fusion protein of claim 70, wherein the C-P region fragment does not 
include a cysteine residue corresponding to position 127 of a full-length C-p region. 



#453669 



23 



.5023- 17«7/9iV 1491 2-391 4602.00/0 

r>£ JotwfeAi. or boiu»ou»Y VeL 149. 9014-3030* No. 12. Decenber 15. 1003 

3opyrt£xe 1993 by The American Aaaoctattcn aT tnmwnofagtsts Printed In U.S. A. 
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WILLIAM D. HUSE. I# TIMOTHY J. STINCHCOMBE.* SCOTT M. GLASER/ LISA STARR/ 
MICHAEL MacLEAN/ KARL E. HELLSTR6M/ INGEGERD HELLSTROM,* and DALE E. YELTON* 

From 'Ixsys. Inc.. San Diego, CA 92121: ^Department of Immunology. The Scripps Research Institute. La JoUa. CA 92037; 
and 'Bristol-Myers Squibb Pharmaceutical Research Institute. Seattle, WA 98121 



We describe the application of a novel filamentous 
phage vector system suitable for efficient screening 
and production of F(ab) antibody fragments* The 
vector system can concurrently produce free F(ab) 
fragments and F(ab) displayed on the surface of Ml 3 
bacteriophage via a V„C H l-pVIII fusion protein. 
When expressed in a supO (nonsuppressor) strain of 
Escherichia colt free F(ab) can be produced. Anti- 
body F(ab) fragments are secreted into culture me- 
dium at concentrations up to 0.3 mg/liter and con- 
veniently subjected to detailed analysis with little 
or no purification. Higher concentrations of F(ab) 
'approximately 10 mg/liter) were found to accumu- 
late in the periplasmic space. In this report the 
vector system is shown to produce correctly folded 
and assembled F(ab) fragments of chimeric L6, a 
tnAb against a tumor-associated Ag expressed by 
many human carcinomas. 

Until recently mAb have been primarily produced in 
mammalian cells. The slow growth rates and difficulty 
n genetically manipulating antibody genes expressed in 
mammalian cells have motivated development of meth- 
xls to express antibody genes In simpler organisms. Mo- 
ecular cloning techniques in bacteria have facilitated the 
production and manipulation of antibody fragments, ln- 
:reaslngly aiding the search for useful antibodies (1.2). 
Vfethods have been reported for the expression of anti- 
body fragments in Escherichia colt using plasmlds (3- 
T), bacteriophage X (8-11). and more recently the fila- 
mentous phage M13 (12-17). Very large combinatorial 
tbraries of 10 5 to 10 9 distinct antibody specif icities can 
created in microorganisms, far greater than can be 
ichieved with hybridoma cell fusion methods (9, 10). 
-lowever. the rapid identification, isolation, and, if nec- 
essary, modification of antibodies with the goal of im- 
proving affinity or redirecting specificity for Ag depends 
leavlly on the availability of powerful screening methods 
x>th in terms of sampling large numbers of antibody 
ragments and evaluating certain aspects of antibody 
binding. Hence, an antibody expression vector that per- 
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mlts rapid cloning and mutagenesis of antibody V region 
genes and produces sufficient levels of antibody for bio- 
chemical analysis would be highly desirable. 

Because antibody fragments can be displayed oh the 
surface of filamentous phage, bacteriophage M 13 vectors 
are proving particularly valuable in creating and screen- 
ing sequence libraries for antibody fragments of interest. 
Briefly, fusion proteins are created by inserting DNA 
encoding an antibody fragment in front of a phage coat 
protein gene (18. 19). The fusion proteins become an- 
chored In the phage coat via the coat protein and antibody 
sequence is displayed at the phage surface. Phage-bear- 
ing antibody sequences of interest can be detected by Ag 
binding and isolated in Infectious form (13, 14). In addi- 
tion to antibodies the incorporation of malarial protein 
(20), growth hormone (21), and a hexapeptlde library (22) 
into the coat proteins of filamentous phage has been 
reported. 

We have developed an M13 filamentous phage vector 
system that can produce and display F(ab) as a fusion 
product to pVm coat protein and can also synthesize free 
F(ab) in quantity. We report here the production of chi- 
meric L6 antibody F(ab) fragments in these M13-derived 
vectors. L6 Is a mAb against a tumor-associated cell 
surface Ag expressed by many human carcinomas (23). 
L6 has been shown capable of lyslng cancer cells in vitro 
(24) and is currently the subject of clinical trials (25). 

MATERIALS AND METHODS 

Construction of bacteriophage M131XL6Q4 for expression of L6 
Ftabl Restriction enzymes, calf Intestine alkaline phosphatase, T4 
polynucleotide kinase. T4 DNA polymerase, and T4 DNA llgase were 
purchased from Boehrlnger-Mannhelra (Indianapolis. IN). Total RNA 
was Isolated from the chimeric LS- secreting cell line described by 
Fell et al. (26) by the guanidlntum thtocyanate-phenol/chloroform 
method (27). First strand cDN A was synthesized using oligo dT and 
BRL Superscript reverse transcriptase (GIBCO BRL. Grand Island, 
NY) and PCR a amplification of LB H chain (V^C„1 ) and L chain (V L - 
O) sequences was performed by the method of Saikl et al. (28) as 
modified by Sastry et al. (29). The following primers were used for 
PCR amplification. The restriction endonuclease recognizing the 
boldface and underlined cloning site in the sequences Is Indicated 
within the parentheses: forward V H primer (Xhol): S'-CACTCTGGA- 
CCTG A GCTCGAQA AGCCTGG AG AG-3 ' : forward V L primer (Ncot) 

1 Abbreviations used In this paper: PCR, polymerase chain reaction: 
gvm. gene VTH coding for the major coat protein of M13 phage; pVHI. 
major coat protein of MIS phage; CmI H chain C region 1.: V^ L chain V 
region; IPTG, lsocropyl-S^thi^lartopyranoside; V H , H chain V region; 
MOPS. 3HN-morpbollno)propane sulfonic add: TES. 30 mM Trts-HCl. 2 
mM EOT A, 2% sucrose (w/v). pH 8.0: supE. strains of £. coll that carry a 
glutamtne Inserting amber (UAG) suppressor tRNA: MES. 20 mM 3-(JV- 
morphollno)propane sulfonic add. 2 mM EDTA, 20% sucrose (w/v). pH 
7.5; supO. strains of £. coil that do not carry a suppressor tRNA. 
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5 ' -^COCAACCAC^XlATGQCCCAAATTQTTCTCTCXCAGTCT-3' ; 
reverse Cwl prtoerTSprfF^Tr^^ 

GGGCTCTGC-3'; reverse Cc primer (Xbal): 5 ^XXXJirAACTCT; 
AGACTAACACTXrrCCCCTGTTCAACCT-3' . The V„-Cm1 and V t -C* 
chains were digested and llgated Into prepared Ml 31X3 1 and 
M13DC12 vectors (30) resulting In M13DC31LG-H and M13DC12L5-L. 
respectively (Fig. 1). Sequence corrections of the cloned L6 Vm-C^I 
and V L -0 chains were accomplished by site-directed mutagenesis 
of uracil-substituted ssDNA as described (31. 32). The correcting 
nucleotides are Indicated by the underlined sequences. Five N-ter- 
minal V M amino adds were Inadvertently omitted from the original 
L6 sequence Information resulting In an 1 1 -amino acid de letion upon 
cloning. One V H correction primer was 5 ' -Cf C TCCAGGcrr iy n C- 
ACCrcAGGT(XAGA GGCTTTTG(C/T)CAC-3' . This primer cor- 
rected the Xhoi cloning site back to the original L6 sequence and 
replaced V„ codons 7 to 10. The C/T mixed site Introduced an amino 
acid change in the leader sequence that was found to Increase 
expression. The following V M primer served to restor e the remaining 
six N-terminai amino acids 5 ' -<!AGCTCAGGTtXXAGA CTGCACC AAC> 
TOG ATCTGC KKX:ATQGCroOTTGGGC-3' . The foUowing V L primer 
served to~correct an Incorrect nucleo tide In the reverse C* PCR 
primer: 3' ACTCTCCCCTGTTGAAGCTCTT^ The H chain 

encoding M 1 31X3 1 L8-H and L chain encoding M13IX1 2L6-L vectors 
were combined by annealing as described (30) to form M131XL604 
(Fig. 1). M13IXL605 was derived from M131XL604 by mutation of 
the TAG (stop) codon. located between the H chain encoding se- 
quence and the pseudo wild-type gene VUI, to GGT (glycine) using 
the oligonucleotide 5 ' -CXJCCTTC AGC ACCGG ATCC ACT AGT-3' so 
that continual V„-C«l-pVIIl fusion protein would be made. DNA 
sequence analysis of ssDNA prepared from phage Isolates was per- 
formed with Sequenaae Version 2 according to the manufacturer 
(United States Biochemical. Cleveland. OH). 

Antibodies and reagents. The anti-Id mAb to L6 have been 
previously described (33). Anti-Id 1 is a 7 2b lsotype. anti-Id 3 la a 
72a lsotype. and both anti-Id 7 and antl-ld 13 arc 7 1 laotypes. 
Alkaline phosphatase ^conjugated anUbodles were purchased from 
Fisher Biotech (San Francisco. CA). Unconjugated goat anti-human 
k antibody was purchased from Caitag Laboratories (So. San Fran- 
cisco. CA). Rabbit antl-M13 IgG was purified by Sepharose-proteln 
A chromatography and exhaustively absorbed against whole E. colt 
The antl-M13 antibody was subsequently biotlnylated using o-bio- 
tlnoyl-camJnocaproic acid N-hydroxysucclnlmlde ester (Boehringer- 
Mannheim) using standard chemistries. Vectastaln avtdin-horse- 
radlsh peroxidase complex (Vector Laboratories. Burilngame. CA) 
and streptavldtn-alkallne phosphatase complex (Boehringer-Mann- 
helm) were used for second step reactions. 

Screening by replicate JMter lifts. M13IXL604 phage were plated 
at a low plaque density. F(ab) expression was Induced by overlaying 
the plate with a 0.45- M nitrocellulose filter (Schleicher and Schuell. . 
Keene. NH) soaked In 10 mM IPTG and Incubating at room temper- 
ature from 6 h to overnight- The filler was removed and placed In 
blocking buffer (Bloalte Diagnostics. San Diego. CA) to block nonspe- 
cific binding sites. Phage growth was resumed by Incubating the 
plate for an additional 2 h at 37*C and a second filter applied as 
described above. This procedure was repeated for the last filter and 
all filters were then blocked In blocking buffer. All monoclonal or 
polyclonal antibodies to be used for screening were diluted in block- 
ing buffer. Filters were probed with either alkaline phoaphatase- 
conjugated goat antibody to human X -chain, alkaline phosphatase- 
conjugated goat antibody to human 1 -chain, or antl-ld 3. which binds 
to L6 antibody, in the case of antl-ld 3. a secondary alkaline phos- 
phataac-conjugated goat antibody to mouse igr^a was used for de- 
tection. All filters were then washed three times for 10 mln with 25 
mM Trts. 0.137 M NaCl. 5 mM KCI. 0.9 mM CaClj, 0.5 mM MgCl,. 
and 0.05% Tween 20 (pH 7.4) and developed with alkaline phospha- 
tase substrate reagent (Blo-Rad. Richmond. CA). 

Flab) production and purification. For analytic scale production 
of F(ab) the supE amber suppressor strain XL- 1 (Stratagene. San 
Diego. CA) and the supO nonsuppressor strain MK30-3 (Boehringer- 
Mannheim) were each grown In 2X YT medium at 37*C until the 
cultures reached a density of 0.4 to 0.6 at OCW Each strain was 
then diluted 1/10 Into three culture tubes containing 3 ml 2X YT 
and Infected with 3 *u of high titer (10" plaque-forming units/ml) 
phage stock of M13DCL604. M13DO605. or M13IX31/tube and In- 
cubated with shaking for 3 h at 37*C. Protein synthesis was Induced 
by the addition of IPTG to a final concentration of 1 mM and shaking 
allowed to proceed for 10 to 14 h at ambient temperature. Peri- 
plasm Ic fractions were prepared essentially as described by Skcrra 
and Pluckthorn (5). The Infected cultures were centrtfuged for 10 

* M 1 31X3 1 and Ml 31X1 2 vectors are available at no charge from lxsys s 
San Diego, CA. 
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Figure 1. Construction of M 1 31XLG04 vector for expression of L6 Fab. 
The M13DCL604 vector product contains one copy of pseudo wild-type 
gVlfl (gVHIH and one wt Id- type gVTII (gvm) both downstream from the H 
chain encoding region. gVHF* has been aKered In sequence to reduce 
homologous recombination with gvm contained on the same vector. The 
presence of gVffl reduces the likeUhood of selection against certain fusion 
proteins as the result of compromised phage viability (34). 

mln at 4*C and the cleared culture supernatant was reserved. The 
bacterial pellet was reauspended In 60 ml cold TIES and digested on 
Ice for 10 mln after adding an equal volume of a cold freshly prepared 
solution containing 2 mg/ml lysozyme in TES. The perlplasmic space 
containing free F(ab) was fractionated by centrtf ugation at 9000 rpm 
for 10 mln at 4*C. The soluble perlplasmic fraction was retained and 
diluted In blocking buffer to adjust the concentration of F(*b) to be 
equivalent to that found in the cleared culture supernatant. 

For purification of L6 Ffab) * 1 -liter culture of MK30-3 was grown 
and Infected with M131XL604 as described above. The cells were 
harvested by centrtf ugation at 5800 x g for 10 mln at 4*C. The pellet 
was resuspended In 20 ml MES at ambient temperature. An equal 
volume of 2 mg/ml lysozyme In MES was added with mild vortexlng 
and the suspension incubated at ambient temperature for 10 mln. 
The soluble perlplasmic fraction was Isolated by centrlf ugation at 
9700 x g for 10 mln at 4*C The perlplasmic fraction was subjected 
to a second centrlf ugation at 12.000 x g f or 30 mln at 4*C. The 
cleared perlplasmic fraction was loaded onto a ma ci oprep 50 S 
support (Blo-Rad) washed extenstvely with 20 mM WOPS. pH 7.5 
and eluted with 20 mM MOPS, 120 mM NaCl. pH 7.5. The partially 
purified L6 F\&b) was concentrated by centrlf ugation in a Centricon 
30 device (Amlcon. Beverly. MA) and size fractionated on a Blogei P- 
60 (Blo-Rad) column equilibrated and eluted with 20 mM MOPS. 120 
mM NaCl. pH 7.5. Eluted L6 F(ab) fractions were assayed for binding 
to anti-Id 3 by EUSA. Fractions eluting at 6. 9. and 10 ml were then 
pooled and concentrated. Samples from each purification step were 
analyzed by SDS-PACE on a 10% nonreduclng gel followed by stain- 
ing with Coomassie brilliant blue. 
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EUSA characterisation of L6 Hob) produced by M13IXL604. 
U3DCL604 phage stocks were prepared for assay by overnight 
rfectlofi of XL-l In LB broth and 10 *g/mJ tetracycline to maintain 
xpresslon of F\ and 1 mM IPTG. The bacteria were removed by 
entrlf ugatlon (10 min in a microfuge) and the phage solution was 
ssayed for binding to L6-speclfic anU-ld 1.3.7. 13. and to an anU- 
-chain antibody by ELISA. All antibodies were coated onto Immulon 
! mlcrottter plates (Dynatech Laboratories. Chantllly, VA) at 10 Mg/ 
il In 0. 1 M NaHCOs. pH 8.S, overnight at 4°C. The antibody solution 
/as shaken out and the plates were blocked with 300 /il of specimen 
'locking buffer (Genetic Systems. Seattle. WA) for 1 h at room 
emperature. Plates were washed before use with 0.5% Tween 20 In 
.15 M NaCl and with the same solution be tw een Incubations. Phage 
llutions were dispensed Into microti ter plates In 100 mI of blocking 
uffer and incubated overnight. After washing. 100 >J of conjugate 
•locking buffer (Genetic Systems) containing 1 /ig/ml of blotlnylated 
abb4t antibodies to Ml 3 were Incubated 1 h at am Went temperature, 
ifter washing. 100 p] of Vectastaln avidln-HRP complex were in- 
ubated 30 min at room temperature.' After a final wash step. 1 00 
I of chromogenlc substrate (3.3'.3.S'-tetramethylbenzldine) In a 
ltrate/phosphate buffer were added. The reaction was stopped with 

00 pi of 3 N H^50 4 at various times ranging from 10 min to 2 h to 
chleye an optimum signal to background ratio for each different 
late -coating antibody. Plates were read on a mtcroplate reader 
3lotck. Burlington. VT) In dual channel mode at 450/630 nm. Ail 
ssay points were measured In duplicate. Ml 31X31 served as a 
.egative control. 

The expression of M131XL604. M13IXL605. and Ml 31X31 In 
act e rial strains MK30-3 and XL-l was also analyzed by ELISA. 
Culture supernatant and diluted periplasmlc fractions prepared from 
ifected XL- 1 and MK30-3 bacteria described above were serially 

1 luted In diluent (Bloelte Diagnostics), added to mlcrottter plates 
oated with anti-Id 3 and Incubated for 2 h at ambient temperature. 
6 F(ab) expressed In the culture medium or In the periplasmlc space 
fas detected with an alkaline phosphatase-conjugatedgoat antibody 
5 human « -chains. Expression of L6 F(ab) displayed on the phage 
urface was detected by incubating the captured sample first with 
he blotlnylated rabbit antibody to M13 and then with stxcptavldln- 
Ikallne phosphatase complex. The washed plates were developed 
>1th 6 mg/ml phenol phthaleln monophosphate In 0. 1 M amlnome* 
hylpropancdlol. 0.5 M Tris. and 0.1% NaN,. pH 1 0.2 (JBL Scientific, 
an Luis Obispo. CA) for 10 to 30 min. The reaction was stopped by 
ddltlon of one-third volume of cold 30 mM Tris base. 6 mM EDTA. 
nd the absorbance at 560 nm measured. 

RESULTS 

Construction of L6 F[ab) expression vector 
113IXL604. The construction of M13IXL604 is shown 
a Figure 1 . Total RNA was isolated from the chimeric 
6-secreting cell line and cDNA synthesized. After PCR 
mpllflcation of the cDNA using sequence-specific 
Timers, the chimeric L6 V L -0 L chain was digested to 
ompletion with Ncol and Xbal and cloned Into the Ncol/ 
flxzl site of Ml 31X1 2 to construct M13IX12L6-L. Slmi- 
irly, the chimeric L6 V h -Ch1 H chain PCR product was 
lgested to completion with Xhol and Spel and cloned 
ito the Xhol/Spel site of M13LX31 to construct 
113IX31L6-H. M13IX31 contains an amber stop codon 
>cated directly 5' of a modified gVHI gene (pseudo-gVin) 
ncoding mature Ml 3 pVHI major coat protein. Cloning 
nUbody V H or Vh-Ch1 regions into the Xhol/Spel site of 
1 131X31 abuts these antibody sequences in frame with 
he amber stop-pseudo gVIH sequence. Thus. 
U3IX31L6-H should produce L6 V„-C„l-pVni fusion 
roduct when expressed in an E. coll amber suppressor 
train such as XL-l and produce predominantly free L6 
'h-ChI protein when expressed In a nonsuppressor 
train such as MK30-3. The H and L chain vectors were 
hen recombined into the single M13IXL604 expression 
hage by annealing HindIU/T4 DNA polymerase -digested 
U31X12L6-L to M/ul/T4 DNA polymerase -digested 
1 1 3DC3 1 L6-H through the homologous regions f ound be- 
Arccn the Hlndm and MIul restriction sites contained In 
oth vectors (30). Correct L6 H and L chain sequences 



were confirmed by DNA sequence analysis. The vector 
M13IXL605, which contains a GGT codon coding for 
glycine in place of the amber stop found in M131XL604, 
was constructed by site-directed mutagenesis of 
M13IXL604. Thus. M13IXL605 should produce L6 V„- 
Cwl-pVTIl fusion product Irrespective of the bacterial host 
used for expression. 

Functional characterization and purification of 
M13lXL604-expressed L6 F(ab). M13DCL604 phage ex- 
pressing L6 F(ab) was initially characterized by replicate 
filter lift assays (Fig. 2). XL-l bacteria were infected at a 
low multiplicity of infection resulting in low plaque den- 
sity of phage. No positive plaques were detected by filter 
lift assay when probed with alkaline phosphatase-con- 
Jugated goat antibody to human XL chain (Fig. 2A). In 
contrast, numerous, superlmposable. uniform positive 
plaques -were detected In assays of human c -chain (Fig. 
2B) and anti-Id 3. which recognizes assembled L6 H and 
L chain (Fig. 2C). Thus, screening by replicate filter lifts 
allows for detection of functional F(ab) expressed in this 
system. 

To further evaluate the functional integrity of L6 F(ab) 
expressed by M13IXL604. phage displaying L6 F(ab) on 
the phage surface were assayed for binding to a panel of 
distinct mouse monoclonal anti-Id raised against the 
mouse mAb L6. ELISA experiments shown In Figure 3 
demonstrated that four anti-Id antibodies specific for L6 




Figure 2. Screening for L8 f(ab} reactivity by replicate filter lifts. 
M13IXL604 phage wm replated at low density Tor screening. Replicate 
filter lifla were prepared and probed with A) alkaline phosphataae-con- 
Jugated goat antibody to human X -chain, B) alkaline phosphataae-conju- 
tfated goat antibody to human < -chain, and C) antl-M 3 antibody. The 
filter In C was washed and Incubated with alkaline phoaphataae-conju- 
gated goat antibody to mouse lg72a. Filters were then washed and devel- 
oped with alkaline phosphatase substrate reagent. 
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are recognized by the fusion protein displayed on the 
M13IXL604 phage. In the ELISA format used, 
M13EXL604 phage were grown in the suppressor bacte- 
rial strain XL-1. and phage displaying Incorporated L6 
F(ab)-pVIIl protein are captured by solid phase anti-Id 
antibodies and detected with antl-M13 antibodies. Thus, 
only phage-bound L6 F(ab) is detected. The four anti-Id 
antibodies exhibit different binding specificities to L6 
(33). Anti-Id 1 and 7 bind to L6 L chain, 13 to H chain 
and 3 to assembled L and H chains. The binding of 
M131XL604 phage to an anti-*-chain antibody further 
confirmed the assembly of L6 L chain with L6 H chain- 
pVIII fusion protein to yield F(ab), because L6 L chains 
are of the k type (Fig. 3). Ml 31X31. which contains no H 
chain-encoding region served as a negative control for 
each anti-Id. A representative curve shows no Ml 31X31 
binding. The ELISA results in Figure 3 thus indicate the 
presence of both L6 L and H chains and confirm that 
proper folding and assembly of L and H chains on the 
phage surface have been achieved. 

Purification of L6 F(ab) fragments produced by 
M131XL604 phage grown in the nonsuppressor bacterial 
strain MK30-3 was accomplished by loading the crude 
periplasm!*: fraction prepared from a 1 -liter shake flask 
culture onto a cation exchange resin. The eluted material 
was concentrated and size fractionated by gel exclusion 
chromatography. Figure 4A shows the elution profile 
from the gel exclusion column as assayed by ELISA for 
functional binding to anti-Id 3 using anti-human * to 
detect captured L6 F(ab). The purified F(ab) was shown 
to have a M r of 42.7 kDa by SDS-PAGE, a value consistent 
with the 48-kDa size of known F(ab) fragments (Fig. 4B). 
Based therefore on both functional and physical proper- 
ties, the identity of the purified material is chimeric L6 
F(ab). 

Regulated expression of M13IXL604 production of 
F(ab) and phage displayed F{ab). The proper activity of 
Ml 3IXL604-expressed L6 F(ab) having been established, 
the functioning of the expression control system of 
M131XL604 was tested. This Involved comparing levels 
of fusion F(ab) with free F(ab) produced when phage were 
grown in suppressor strains (supE) vs nonsuppressor 
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Figure 3. ELISA characterization or L6 F(ab) produced by M 1 3IXL604. 
Various dilutions of phage expressing L6 F(ab) on the phage surface were 
evaluated for binding to anti-Id 1. 3. 7. and 13 and to an anti-human <- 
chain antibody. Bound phage were then detected as described in Mate- 
rials and Methods. Curves represent the average of triplicate assays. 
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Figure 4. Purification of L6 F(ab) expressed by M 1 31XL604- Infected 
MK30-3 bacteria. A) elution profile of L6 F{ab) fracUons assayed for 
binding to anti-Id 3 antibody by EUSA. Fractions eluting at 8. 9. and 10 
ml were then pooled and concentrated. B) SOS-PAGE analysis of purified 
Fab. Samples from each purification step were analyzed by SDS-PAGE 
on a 10% nonredudng gel followed by staining with Coomassle brilliant 
blue. The L6 chimeric F(ab) standard was prepared from whole IgG by 
papain digestion followed by protein A chromatography (Pierce. Rockford* 
IL). 

strains (supO) of bacteria, respectively. This comparison 
was made using ELISA assays employing anti-Id 3 to 
detect assembled L6 F(ab). F(ab) displayed on the surface 
of M 1 3 phage was detected using the M 1 3 assay described 
in Figure 2. M13IXL605. a variant construct of 
M13IXL604 that has the amber stop codon replaced with 
a glycine encoding codon and therefore always synthe- 
sizes V„-Ch1-pVTII protein, served as a control. 

When phage were grown in the supE strain XL- 1 . both 
M13IXL604 and M13IXL605 secreted phage displaying 
F(ab) into the culture medium (Fig. 5A) and, as expected, 
little or no mature phage displaying F(ab) was detected in 
the periplasmic space (Fig. 5B). In the supO strain 
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figure 5. Characterization of the 
3IXL604 expression control system, 
uld cultures of the amber suppres- 
straln XL-1 (supE) and the non- 
preaaor strain MK30-3 (supO) were 
b Infected with either M131XL604. 
300605. or M13DC31 and culture su- 
natant and pertplasmlc fractions pre- 
ed aa described In Materials and 
Jwds. Culture supernatant and peri- 
im from Infected cultures were as- 
ed for functional binding to anU-Id 3 
F(ab) displayed on phage surface (A to 
ind by total i£ to H). 
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C30-3. the amber stop codon in M13IXL604 should 
rvent fusion protein formation. Figure 5C shows phage 
iplaylng F(ab) to be sharply reduced in M13IXL604 vs 
3DCL605. The low signal seen In the M13DCL604 cul- 
*e medium is presumably caused by nonspecific asso- 
Uon of L6 F(ab) with phage particles and/or transla- 
nal readth rough. Again no mature phage displaying 
ib) was found in the periplasm (Fig. 5D). In examining 
r production of F(ab) by the anti-Id 3/*-chaln assay 
th M13IXL604 and M131XL605 secreted anti-«-reac- 
c material into the culture medium when expressed in 
her XL-1 or MK30-3 (Figs. 5E and 5G). Because the 
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efficiency of amber suppression can be widely variable, 
we expect that M131XL604 expressed in XL-1 should 
secrete free F(ab) in addition to F(ab) displayed on the 
surface of phage. Thus, the signal in Figure BE should 
represent some combination of the two molecules. Poly- 
ethylene glycol precipitation of both M13IXL604 and 
M13IXL605 phage from induced XL-1 culture superna- 
tants showed that approximately 95% of the anti-«-reac- 
tlve material remained In the polyethylene glycol-cleared 
supernatant (data not shown) Indicating that the predom- 
inant secreted product Is free L6 F(ab). Consistent with 
the view that M13IXL664 and M13DCL605 secrete free 



SCREENING AND PRODUCTION OF 

L6 F(ab), abundant amounts of soluble assembled F(ab) 
were detected In the periplasmlc space In both strains of 
E. coll (Fig. 5, Fand H). MK30-3 appeared to consistently 
produce higher concentrations of F(ab) than XL-1 . Quan- 
titative ELISA demonstrated that the concentration of 
free F(ab) produced by M13DCL604 grown in the supO 
strain MK30-3 approaches values up to 0.3 mg/llter In 
culture supernatant and concentrations approaching 10 
mg/llter within the periplasmlc space (data not shown). 

DISCUSSION 

Ml 31X31 (for H chains) and M131X12 (for L chains) are 
the M13 vectors that can receive polyclonal sets of V 
regions for constructing combinatorial libraries or indi- 
vidual antibody genes for V region production and/or 
mutagenesis (30). The system Is the first to allow differ- 
entially controllable production of free and fusion F(ab). 
Upon recombtnlng the two vectors, all of the control 
elements required for regulated expression of phage dis- 
played F(ab) or free F(ab) are contained in a single expres- 
sion vector. For phage display F(ab) is synthesized as a 
fusion product to pVHI coat protein, the major structural 
protein of the filamentous Ml 3 phage particle. In the 
Ml 3 bacteriophage the major coat protein pVUI is ex- 
pressed In several thousand copies per phage particle. 
Although this level of expression may be useful for pro- 
ducing F(ab) as F(ab)-pVIII fusion proteins, pVlH synthe- 
sized solely as a fusion product Is not likely to form a 
properly assembled coat as a result of steric hindrance 
by the much larger H chain polypeptide. We reasoned 
that If wild-type pVHI, In addition to F(ab)-pVm fusion 
product. Is available for assembly then the formation of 
a mature Infectious phage particle could occur. With this 
in mind, the H chain-gVIU expression vector Ml 31X31 
was constructed to contain two copies of gVIII. The copy 
that anchors F(ab) to the surface of M13 has had gVIII 
codons extensively altered (pseudo gVIII) to prevent re- 
combination with the wild-type gvm. 

To demonstrate the utility of these vectors we cloned 
the antibody V L -Gc- and V h -Ch 1 -encoding sequences from 
the chimeric L6 transfectoma cell line Into the Ml 31X1 2 
and Ml 31X31 vectors, respectively. The two vectors were 
recombined to create M13IXL604. M13IXL604 expresses 
a dicistronic message encoding both H and L chain L6 
sequences under transcriptional control of an inducible 
Lac promoter. An amber stop codon resides between the 
H chain-encoding region and the pseudo gVIII coat pro- 
tein. When grown in an amber suppressor strain [supE) 
of E. coll. M131XL604 was shown to produce L6 H chain- 
pVIII fusion protein resulting in phage-displayed F(ab) In 
addition to free F(ab). When free F(ab) was desired as the 
major end product, the phage was grown in a non sup- 
pressor strain (supO), The M13IXL605 vector, which 
solely produces F(ab)-pViII fusion protein, was con- 
structed to evaluate, by comparison, the M 1 31XL604 con- 
trol system. 

M13IXL605 was found to display slightly higher levels 
of phage-assoclated F(ab) than M13DCL604 when grown 
in the supE bacterial strain. This is likely attributed to a 
higher proportion of F(ab)-pVIH fusion protein relative to 
wild-type pVTU produced in the M131XL605- infected 
strain. In both the supE and supO bacterial strains 
M13IXL604 secreted chimeric L6 F(ab) at levels some- 
what higher than those of M13IXL605. Phage titers of 
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M13IX31-. M13KL604-, and M13KL605-infected cul- 
tures were found to decrease relative to the level of F(ab)- 
pVUI fusion protein incorporation (D. Yelton, unpub- 
lished observations). Taken together, these results sug- 
gest that a functionally viable phage particle may be able 
to tolerate a limited number of incorporated F(ab)-pVIlI 
fusion products and that the amount of F(ab) incorpo- 
rated into the phage coat may inversely affect phage titers 
and overall F(ab) yield. Furthermore, the appearance of 
free F(ab) in the culture media suggests that H chain- 
pVffl fusion protein unincorporated into phage is properly 
assembled with L chain and secreted as functional F(ab). 

The set of anti-Id antibodies raised against the murine 
m Ab L6 demonstrated that faithful expression and func- 
tion of bacterially produced chimeric L6 F(ab) occurs In 
the M 1 3IX 1 2 and M 1 31X3 1 vector system. L6 F(ab) activ- 
ity is readily detectable by either the nitrocellulose filter 
lift or ELISA formats. The anti-Id antibodies used in these 
studies serve as a convenient and informative model for 
antibody-Ag binding, inasmuch as the tumor Ag bound 
by L6 in vivo is yet to be purified or fully characterized. 

The Ml 31X31 and Ml 31X12 vector system described 
here can serve as a versatile, general purpose approach 
to F(ab) production and screening. F(ab) production in 
this system is sufficiently robust to permit multiple rep- 
licate filter lifts, a practical requirement for implement- 
ing various screening strategies. The vectors can be used 
to create combinatorial antibody libraries to identify 
novel antibodies. Screening of sufficiently large combi- 
natorial antibody libraries could potentially allow useful 
antibody fragments of murine or human origin to be 
isolated without the necessity to perform standard im- 
munization procedures (35). In addition, because ollgo- 
nucleotlde-directed mutagenesis Is convenient and highly 
efficient in Ml 3 these vectors are ideal for engineering 
antibodies with new properties. It is conceivable that the 
ability to create stable, high affinity human antibodies is 
a promising endeavor. 

In a companion publication (36), we show that the 
Ml 31X31 and M13DC12 antibody expression system 
lends itself to efficient antibody engineering by site-di- 
rected mutagenesis. Codon-based mutagenesis of L6 hy- 
pervariable regions proved effective in altering the fine 
specificity of L6 In a predefined manner. Thus the system 
allows mutagenesis, screening, and mutant F(ab) produc- 
tion to be accomplished very rapidly. 
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A PHAGE DISPLAY SYSTEM FOR DETECTION OF T CELL 
RECEPTOR-ANTIGEN INTERACTIONS 

TAKEBUMI ONDA,*tJ DRAKE LAFACE,*J§ GOTTFRIED BAIER,*|| 
THOMAS BRUNNER,* NAKAYUKI HONMA,1J TOSHIFUMI MIKAYAMA,!? 
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•Division of Cellular Immunology, La Jolla Institute for Allergy and Immunology, 1 1 149 N. Torrey 
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Macbashi Shi Gunma; Japan 

(First received 29 June 1995; accepted in revised form 30 June 1995) 

Abstract— The process of T cell recognition involves a complex set of interactions between the various, 
components of the TCR/MHC peptide trimolecular complex. We have developed a system for 
exploring the specific binding interactions contributed by the' constituent subunits of TCR complexes 
for components of their tigands. We utilized an M13 phage display system, designed for multivalent 
receptor display, to explore specific binding interactions between various TCRcc chains and specific 
antigen in the absence of MHC. The multivalent TCR-phage display system was sensitive enough to 
reveal some TCRa chains capable of binding directly to antigen with the same fine specificity shown 
by the MHC-restricted T cells from which the « chains were derived. Cross-specificity analysis using 
two antigen-binding TCRa chains derived from T cells with different polypeptide antigen specificities 
confirmed the fidelity of this binding. In mixtures of antigenrbinding and non-binding TCRa-dis- 
playing phage, specific selection was achieved at a starting frequency of 1/1000, suggesting that this 
system can be employed for selection and analysis of TCR-dispIaying phage libraries. While the 
binding specificities exhibited by these TCRs are unusual, they provide a novel perspective from 
which to study the specific binding interactions that constitute TCR antigen binding. 

Key words: Ml 3 filamentous phage, peptide binding, phage-display, TCR. 



INTRODUCTION 

Specific recognition by T lymphocytes is mediated by T 
cell antigen receptors (TCR) (Jorgensen et a/., 1992b; 
Chien and Davis, 1993). The most extensively studied 
TCR-ligand interactions involve TCRa/? heterodimers 
binding to peptide-MHC complexes. A number of stud- 
ies have focused on defining the intricate biochemical 
interactions between the various components of the TCR 
and MHC-peptide complexes. As a consequence of these 
investigations, it has been suggested that residues within 
the CDR3 regions of TCRa and fi chains interact with 
specific residues from the antigenic peptide fragment 
(Davis and Bjorkman, 1988; Engel and Hed rick, 1988; 
Danska et at., 1990; Jorgensen et at., 1992a). More recent 
studies have indicated that TCR interactions with peptide 
appear to be very specific, while interactions with MHC 
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are comparatively degenerate (Ehrich et at., 1993). The 
TCR-MHC interactions could be established in a variety 
of configurations with the same TCR and MHC and 
appear to be influenced by the interaction of the TCR 
with the peptide. Thus, while TCR interactions with resi- 
dues from both peptide (Engel and Hedrick, 1988; Dan- 
ska et at., 1990; Jorgensen et at., 1992a) and MHC 
(Ajitkumar et at,, 1988; Peccoud et at., 1990) appear to 
be essential for efficient T cell recognition, the dominant 
interactions are seemingly mediated by CDR3 residues 
associated with specific peptide (Ehrich et at., 1993). 

Although most studies have focused on conventional 
TCR binding of MHC-peptide ligands, a number of 
alternative TCR-ligand interactions have emerged, pro- 
viding the basis for further studies in TCR specificity. 
Examples include TCR ft chain-binding to superantigens 
(White et at., 1989; Gascoigne and Ames, 1991; White 
et at., 1993); TCRs which specifically bind peptide-free 
mycolic acid antigens in the context of CD I restricting 
elements (Porcelli et at., 1992); and TCRs with speci- 
ficities for carbohydrate moieties of post-translationally 
modified peptide antigens (Haumm et at., 1994; 
MichaSlsson etai, 1994). These recent studies of non- 
conventional TCR-ligand interactions lend credence to 
earlier studies of T cells which expressed afi TCRs capable 
of binding directly to hap ten-conjugates (Rao et a/.. 
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1984a, 1984b; Siliciano et <z/., 1986) or peptide antigens 
(Esch and Thomas, 1990) in the absence of MHC 
proteins. Interestingly, direct TCR-binding interactions 
with hapten or peptide ligands, in the absence of MHC, 
demonstrated relatively high affinity constants (KoS = 5 
x 10~ 5 — 6 x 10"*) when compared to the normal range 
of TCR-ligand affinities (Rao et a/., 1984b; Siliciano et 
aL, 1986; Esch and Thomas, 1990; Davis and Chien, 
1993). 

Studies of TCR that directly bind nitrophenol haptens 
suggested that the specificity was mediated by the TCRa 
subunit in a manner independent of TCR0 (Kuchroo et 
al. t 1991). Similar studies by our group and others have 
suggested the possibility that a subset of TCRa molecules 
may bind directly lo antigen for which the T ceU encoding 
ihc TCR protein is specific (Bissonnetfe et aL, 1991; 
Green et aL 1991; Mori et o/., 1 993). While these unusual 
TCR-ligand interactions may represent a relatively small 
subset of TCR specificities, they are nonetheless 
intriguing because they provide an alternative approach 
with which to explore the many complex interactions that 
mediate specific binding. Since the constituent a and /? 
chains of TCR may bind different residues of the peptide 
ligand component in a relatively autonomous manner 
(Jorgensen et al„ 1992a, 1992b), we reasoned that it 
should be feasible to study directly the specific inter- 
actions of TCRa chains with antigenic peptide. An 
elemental analysis of the specific interactions that con- 
stitute the complex binding of TCR with MHC-peptide 
ligands would be extremely useful. While these studies 
would not refute existing theories of conventional TCR- 
antigen binding, they would provide a unique perspective 
from which to study the individual specific binding inter- 
actions between the various subunits of the TCR- 
MHC/peptide complex which contribute to the overall 
TCR binding specificity. 

We employed the filamentous phage display system for 
the study of immunological receptors (Kang et a/., 1991; 
Barbas and Lerner, 1991b) to directly assess specific bind- 
ing interactions of several TCRa polypeptides with anti- 
genic peptide and globular proteins in the absence of 
MHC. The multivalent phage display system, utilized to 
enhance avidity, was optimally suited to study inter- 
actions of TCR molecules directly with their peptide 
ligand, as the binding affinities are relatively low com- 
pared to immunoglobulin receptors. 



EXPERIMENTAL PROCEDURES 

Reagents, strains, vectors 

The Escherichia coli XLl-Blue (tet^ and VCSM13 
helper phage (kan r ) were purchased from Slratagene. All 
enzymes were purchased from Boehringer Mannheim 
and Promega. The phagemid vector pComb8 (Kang et 
a/., 1991) utilized to produce clones encoding TCRa 
chains fused to the cpVIII of M13 filamentous phage 
was generously provided by Denise Burton (The Scripps 
Research Institute). All molecular biology procedures 
were performed according to conventional techniques as 



described in Molecular Cloning: a Laboratory Manual 
(Sambrook et aL, 1989) unless indicated otherwise. 

Peptides and antibodies 

Poly 1 8 peptides were kindly provided by Dr Bhagjrath 
Singh (University of Western Ontario, Canada). Bee 
venom phospholipase A 2 and bovine phospholipase A 2 
were purchased from Sigma. Cell lines producing hamster 
anti-mouse TCRa (H28-710.I6) and anti-TCR0 (H57- 
597 hamster IgG) were kindly provided by Dr Ralph 
Kubo (Cytel, La Jolla, CA, U.S.A.). Supernatants were 
concentrated by precipitation with 45% saturated 
ammonium chloride followed by dialysis, and the anti- 
bodies were purified by protein A chromatography. 

The TCRa cDNA and clones 

Full-length cDNA sequences encoding the A I . I TCRa 
(Green et at., 1991), 5C.C7 TCRa (Fink et at., 1986) 
(provided by S. Hedrick, University of California, San 
Diego, CA, U.S.A.) and the 3B3 TCRa (Mori et al. f 1993) 
(T. Mikayama, unpublished sequence), respectively, were 
used as templates in the polymerase chain reaction (PCR) 
subcloning procedures. Fifty nanograms of cDNA tem- 
plate was mixed with 60 pmol of each primer (Table I), 
200 mM dNTP, Promega Taq polymerase buffer con- 
taining 1.5 mM MgCl 2 and five units of Taq polymerase 
in a final volume of 100 ml. Amplification was carried out 
for 20-30 cycles on a TwinBlock thermal cycler (Ericomp 
Inc., San Diego, CA, U.S.A.) under the following con- 
ditions: 1 min of denaturation at 94°C, 2 nun of primer 
annealing at 50°C, 1 min of elongation at 72°C, followed 
by a final elongation at 72°C for 15 min. Amplified prod- 
ucts were hydrolysed (4 hr at 37°C) at the Xhol and Xbal 
sites (25 U enzyme//ig PCR fragment) encoded by the 
primers, size-fractionated by agarose gel electrophoresis 
and purified using a Gcneclean procedure (Bio 101 Inc., 
La Jolla, CA, U.S.A.). The PCR primers were designed 
to produce TCRa cDN A sequences that could be ligated 
into the pComb8 vector in frame with the pelB leader 
sequence at the 5' end and fused to the N-terminus of the 
cpVIII protein on the 3' end. The purified Xhol - Xbal 
insert was directionally cloned into the phagemid vector 
pComb8 at the Xhol and Spel sites and transformants 
were screened with internal oligonucleotide probes in 
colony lifts. Plasmid DNA from positive clones were 
sequenced by the dideoxy method using Scquenase 2.0 
(USB Corp., Cleveland, OH, U.S.A.), analysed using 
the MacVector analysis program (IBI, New H^ven, CT, 
U.S.A.) confirming identity, orientation and in-frame 
cloning of TCRa cDNAs. 

Recombinant phage preparation 

Production of recombinant filamentous phage dis- 
playing TCRa chains was carried out essentially as pre- 
viously described for the production of phage displaying 
immunoglobulin receptors (Kangef aL, 1991; Barbas and 
Lerner, 1 99 lb). Briefly, E. coli XLl-Blue (Stratagene) 
cells were transformed with recombinant TCRa/pComb8 
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Variable region primers 

TCR Va (sense) 
5C.C7 

TCR V« (sense) 
3B3 

TCR Va (sense) 



Constant region primers 

TCR-VJC 

TCR Ca (antisense) 

TCR-VJ 

Truncated Cor (antisense) 



Al 



y-fiAAGAG CTCGAGA TGAAATCC TGAGT-3' 
5-TGGACACICQAQATGCAGAGGAAC( 



5 'XKXKXGCTCG AG ATG AAATCCTTXj AGTGTTTT A CT A 
GTGGTCCTGTGGCTCCAGTT AAACTGCGTGAGG A GC- 
CAGCAGCAAGTGCAGCAGAGTCCTGCA-3' 



5'-GCTGTCIO£<^<^CACCG^ 
5MZAGGAGICIAaAGCCACCG<XACCGTCGATO 



Restriction sites used 



phagemid and selected on LB plates containing 100 /ig/ml 
ampicillin. Fresh ampicillin-resistant colonies were 
grown in liquid culture on a shaker at 37°C in super- 
broth (SB) for 1-2 hr in the presence of ampicillin (50 
/ig/ml) to select for cells bearing phagemid and tetra- 
cycline (10 jig/ml) to induce the F episome. The cultures 
were then incubated with 1 mM i so propyl /?-r>thio- 
galactoside (DPTG) for 1 hr to induce production of the 
TCR-cpVTII fusion protein and then superinfected with 
VCSM13 helper phage (10 12 pfu) (Stratagcne) to give a 
final phage/ceil ratio of 10-20:1. The cells were shaken 
(37°C) for an additional 2 hr and then selected for 
helper phage induced antibiotic resistance (kanamycin, 
70 /ig/ml) and grown overnight, 37"C, 250 rpm. Phage 
supernatant was cleared by centrifugation of the cultures 
(4000 rpm in a GSA rotor, Sorvall, at 4°Q. The phage 
were precipitated by adding 3 % (w/v) NaCl and 4% (w/v) 
polyethylene glycol 8000 for 1 hr at 4*C and centrifuged 
at 9000 rpm in a Sorvall GSA rotor at 4°C. The phage 
pellets were resuspended in PBS to 1/25 of the original 
volume and aggregated phage and debris were removed 
by centrifugation for 5 min in a benchtop micro- 
centrifuge. Phage supernatants were transferred to fresh 
tubes and stored in aliquots at — 20 3 C. 

Affinity selection panning 

The panning procedure utilized to screen binding of 
TCR displayed on the surface of filamentous phage was 
a modification of the original protocol described by 
Parmley and Smith (1988). Wells of a microti tration 
ELISA plate (Immulon 2, Dynatech) were coated with 
peptide or phospholipase A 2 (Sigma) in sodium bicar- 
bonate (0.1 M, pH 9.5) at 4°C overnight in a volume of 
50 fi\ at the concentrations indicated in the figure legends. 
Antibodies were coated onto plates using Tris (50 mM, 
pH 9) at 4°C overnight in a volume of 50 /il at the 
concentrations indicated in the figure legends. The wells 
were washed twice with PBS (Dulbccco's, pH 7.4) and 
blocked by completely filling the well with 3% bovine 
serum albumin (BSA) (Sigma, fraction V) and incubated 
for 1 hr at 37°C. The blocking solution was flicked out, 



rinsed with PBS and 50 /il recombinant phage was added 
to each well (typically 10'°-10 M CFU unless otherwise 
indicated) and incubated for 2 hr at 37°C. The phage 
were then removed and the wells were washed 10 x (pip- 
etting up and down to wash) with TBS/Tween (50 mM 
Tris-HCl, pH 7.5, 150 mM NaCl, 0.05% Tween 20) over 
a period of 1 hr allowing the wells to remain filled between 
washings. Washing was done carefully to ensure simi- 
larity between samples, experiments and individuals. 
Adherent phages were eluted with 50 pi elution buffer 
(0. 1 M HC1 adjusted to Ph 2.2 with solid glycine, I mg/ml 
BSA) for 10 min at room temperature and neutralized 
with 6 ml of I M Tris base. Eluted phage were used to 
infect fresh E. coli XLI-Blue cells, OD«» = I (grown in 
the presence of 10 /ig/ml tetracycline to induce F' episome 
expression) for 15 min at 37°C followed by selection 
on LB/ampicillin plates to assess the number of bound 
recombinant phage per ml of eluted phage. 

Immunoblot of TCRx-p VIII fusion proteins 

Phagemid DNA of TCR-pComb8 or control con- 
structs were transformed in XLI-Blue cells and trans- 
form ants were inoculated in 25 ml 2 xTY medium 
containing 50 /ig/ml ampicillin. Protein expression was 
induced with 0. 1 mM IPTG at an OD«o = 1 for 7 hr at 
26°C. Cells were harvested by centrifugation. sonicated 
on ice for I min in PBS, 1 mM PMSF, 1% NP-40 and 
the resulting lysate was centrifuged (B.OOOg, 4 C C) to 
remove insoluble debris. The soluble cellular proteins 
were analyzed by SDS^PAGE and immunoblotted with 
an anti-TCRa antibody (H28-7 10. 16). 

Presentation assay 

The TCR specificity of A 1.1 hybridoma cells for our 
poly 18* related peptide analogues was assessed by mea- 
suring the 1L-2 responses of Al .1 cells to peptide-pulsed 
BALB/c (I-A d ) spleen cells (y-irradiated 2000 rad). Super- 
natants were collected following overnight peptide stimu- 
lation and the relative production of IL-2 was assessed 
in a CTLL assay. Starved CTLL cells were washed and 
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incubated in various dilutions of sample or control super- 
natant* (100 /il) in 96- well plates (Costar) at 37°C for 
16 hr. The cells were then pulsed with [ 3 HJ- thymidine 
(1 /iCi/well) and harvested an additional 4-6 hr later. The 
IL-2-dependent proliferation was assessed by the relative 
level of [ 3 H]-thymidine incorporation (cpm). 

Hybridization of replica plates 

Following two rounds of affinity selection panning 
(described above), ampicillin-resistant CFUs were blot- 
led onto replica filters as described (Sam brook et aL, 
1989). The replica filters were denatured (0.5 N NaOH, 
1.5 M NaCl) and neutralized [1.5 M NaCI, 0.5 M Tris- 
HCL (pH 7.4)J and then fixed (80°C, 2 hr). The filters 
were then hybridized with a 5' ^-labelled A 1.1 TGR V<x 
specific probe (5-GAAGAGCTCGAGATGAAATC- 
CTTGAGT-3'). 

RESULTS 
The TCR phage display 

We previously presented evidence suggesting that the 
TCRot molecule from the A 1.1 T cell hybridoma (Vot 1.2, 
JaTA65) may bind directly to antigenic peptides for 
which this T cell is specific (Bissonnette et al. , 1 99 1 ; Green 
et al.> 1991). The Ml 3 filamentous phage display system 
(Kang et aL, 1991; Barbas et aL y 1991a; Barbas and 
Lerner, 1991b) provided the means to directly assess the 
ability of Al.l TCRot polypeptides to specifically bind 
to peptide antigens. We utilized the phagemid vector, 
pCombS (Kang et al. f 1991) to generate recombinant 
phage-displaying TCR proteins fused to the N-terminus 
of the coat protein VIII (cpVIII). The cpVlIl molecules 
form the capsid coat during phage assembly (utilizing 
about 2500 cpVIII proteins per phage particle) allowing 
for multiple receptors to be displayed on the surface of 
the recombinant phage (Felici el aL, 1991; Greenwood et 
al., 1991; Kang et a/., 1991). This multivalent expression 
can greatly enhance the avidity, which facilitates the 
study of binding interactions with moderate affinity. 

The PCR-generatcd fragments encoding the VJ- or 
VJC-domains of the A 1 . 1 TCRot molecule were subcloned 
into the pComb8 phagemid vector to generate recom- 
binant phage displaying multiple copies of the Al.l 
TCRot molecule on the surface. This is illustrated in Fig. 
1 (see Experimental procedures). Extracts of cells trans- 
formed with various recombinant or control phagemids 
were subjected to SDS-PAGE and immunoblotted with 
an anti-TCR Cot specific antibody (H28.7t0.16) to con- 
firm that TCRa/cpVIII fusion proteins were produced. 
As shown in Fig. 2A, a protein that migrated with the 
predicted molecular weight for the TCR-VJC/cpVIIl 
fusion protein, Mr 36.5 LDa, was detected from extracts 
of cells transformed with the Al . 1 TCR-VJQt construct. 
A second protein of M r 34.5 kDa was also detected, which 
probably represents a partially degraded form of the 
fusion protein, since it was not observed in the other 
lanes. No bands were observed from extracts of cells 
transformed with either the parental phagemid (pCombS, 
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Fig. 1. Schematic protocol for the generation of chimeric con- 
structs. (A) Full-length cDNA sequences encoding various 
TCRat chains were used as templates in the polymerase chain 
reaction (PCR) subcloning procedures. The 5' PCR primers 
(arrows) contained an Xhol site designed to generate a PCR 
fragment that could be ligated into the pCombS vector in frame 
with the pelB leader sequence. The 3' primers (arrows) con- 
tained a sequence encoding a flexible linker (GOGS) and an 
Xba\ site to generate a fusion gene with the major cpVIII upon 
ligation. The PCR fragments were digested with XbaX and Xhoh 
purified and directionaily cloned into the phagemid vector 
pCombS. (B) The recombinant TCRar-pCombS phagemids 
contained an amp' gene utilized to facilitate drug selection of 
recombinant phage. Production of the recombinant 
TCRa/cpVIII fusion protein was driven by the inducible LacZ 
promoter following transformation of competent XL 1 -Blue 
cells (Stratagene). Transport into the periplasmic space during 
production was mediated by the pelB leader sequence. (C) The 
predicted structure of the recombinant TCRat/cpVllJ fusion 
proteins consisted of a TCRa polypeptide and a flexible linker 
(GGGS) fused to the N-terminus of the cpVIII molecule. The 
pelB leader sequence at the N-terminal end is cleaved off fol- 
lowing transport into the periplasmic space. The recombinant 
fusion protein was incorporated into the viral capsid coat dur- 
ing phage assembly to generate recombinant M13 phage dis- 
playing TCR molecules on the surface. 



vector only) or the Al . 1 TCR VJa construct, which both 
lack the Ca domain recognized by the H28.710.16 anti- 
body used for detection. 
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Fig. 2. Expression of TCRa/pComb8 recombinant fusion 
proteins. (A) Western blot analysis to detect expression of A 1 . I* 
TCR-VJOr/cpVllI fusion proteins in E. colu Lysates from 
recombinant TCR*-pComb8 (lane I: A 1.1 -TCR VJ; lane 2: 
Al.l-TCR-VJQ or parental pComb8 (lane 3: vector only) 
phagemid transformed XL I -Blue cells were subjected to SDS- 
PAGE and immunoMotted with an anti-TCR Cat antibody 
(H28.710.16). The position of the band migrating at the pre- 
dicted molecular weight of the TCR-VJGx/cpVIII fusion pro- 
tein (M, 36.5 kD) is indicated by the arrow. (B) Display of 
TCR-VJCa polypeptides on the surface of recombinant phage. 
Binding of recombinant phage to anti-TCR antibody was uti- 
lized to detect the presence of TCR on the phage surface. 
Recombinant TCRot-pComb8 phage or parental pComb8 
phage (indicated in the figure) were panned on plates coated 
with antibodies specific for TCR Ca (H28.710. 16, solid bars) or 
TCR V06 (H57-597, hatched bars) The number of bound 
phage was assessed by counting the number of efuted ampiciUin 
resistant CFU following extensive washing as described in the 
experimental procedures. 



The presence of TCRa molecules displayed on the sur- 
face of recombinant phage was assessed using anti-TCR 
antibodies. Recombinant phage were "panned" on plas- 
tic dishes (Pannley and Smith, 1988) coated with either 
anti-TCR Ca (H28-7 10.16) or anti-TCR Cp (H57-597) 
antibodies (Fig. 2B). Following extensive washing, the 
bound phage were eluted with a low pH buffer. Sub- 
sequent infection of host cells with eluted phage was 
readily achievable as the cpIII proteins, utilized for 
adsorption to the F' episome and infection of E. colU 
remain intact following acid elution. Since the pComb8 
phagemid vector contains an amtf gene, the number of 
bound phage could be assessed by counting the number 
of ampiciUin-resistant CFU following infection of host 
cells with eluted phage. In addition to recombinant phage 
displaying A 1 . 1 TCRa chains, we also constructed phage 
displaying the TCRa chains from the 5C.C7 T cell hybrid- 
oma (specific for pigeon cytochrome C peptide restricted 



to I-E k ) (Fink et a!., 1986). As shown in Fig. 2B, both 
the A 1.1 TCR-VJCa as well as the 5C.C7-TCR-VJCa 
recombinant phage were readily bound by the anti-TCRa 
but not the anti-TCR^ antibodies. As predicted, the par- 
ental pComb8 phage (vector only) and recombinant 
TCR-VJa phage showed no binding to cither antibody 
as these phage lack the TCR Ca determinant recognized 
by H28.710.16 antibody. Thus, polypeptides bearing 
TCR Ca specific determinants were effectively displayed 
on the surface of recombinant phage particles, indicating 
that TCRa/cpVIII fusion proteins could be effectively 
incorporated into the capsid coat during phage assembly. 

Specificity of TCR displayed on. recombinant phage 

The A 1 . 1 T hybridoma cells recognize a synthetic poly- 
peptide with the sequence poly [EYK(EYA) 3 J (poly 18) 
presented by 1-A d (Fotedar et a/., 1985). The antigenic 
fine specificity of A 1.1 cells for a series of poly 1 8 -related 
peptide analogues was assessed by measuring the IL-2 
responses to peptide pulsed BALB/c (I-A d ) spleen cells. 
As shown in Fig. 3 A, A 1.1 cells produced cytokine in 
response to two of the peptides, EYK(EYA) 4 and EYK- 
(EYA) 4 EYK. Peptide analogues, substituted with ala- 
nines at residues 3 or 10, failed to stimulate A 1.1 cells. 
These results are in agreement with those reported by 
others for this cell line (Fotedar et aJ. f 1985). The anti- 
genic fine specificity of the A 1.1 hybridoma cells for the 
four peptides described above provided a model with 
which to compare specificity of TCRa molecules dis- 
played on recombinant phage. 

The poly 18-related peptide analogues utilized to 
characterize the specificity of A 1.1 hybridomas were 
coated onto 96-well plates for panning experiments to 
assess the binding capacity and specificity of the TCRa 
molecules displayed on the phage surfaces. Bound phage 
were extensively washed, eluted and then quantitated by 
counting the number of ampicillin-resistant CFUs fol- 
lowing infection of host cells. Representative of a series of 
10 binding experiments, Fig. 3B shows that recombinant 
phage expressing AI.l TCR-VJCa preferentially bound 
to the antigenic peptides, EYKEYAEYAEYAEYAEYK 
and EYKEYAEYAEYAEYA, but not to non-antigenic 
peptides with an alanine substitution at residues 3 (EYA- 
EYAEYAEYAEYA) or 10 (EYKEYAEYAEYAEYA). 
The parental pComb8 phage (vector only) and recom- 
binant phage displaying 5C.C7-TCR-VJCa showed no 
specific binding to any of the four poly 18 peptide ana- 
logues (Figs 3B and 4A). Recombinant phages displaying 
A 1.1 TCR VJa, which lack the Ca-domain, also showed 
preferential binding to the antigenic peptides, indicating 
that the specific binding was mediated by the TCR Va- 
and Ja-region domains (Fig. 4B). The observed pref- 
erential binding to antigenic peptide was reproducible 
and consistent over a wide range of phage titres when 
we collected the quantitative data from seven different 
experiments utilizing several different recombinant phage 
preparations from two different recombinant constructs 
(Fig. 4C). Thus, direct binding by phage displaying A 1.1 
TCRa to peptides reflects the antigenic specificity of the 
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Fig. 3. Antigenic fine specificity of A 1. 1 hybridoma cells and 
A 1. 1 -TCRa Recombinant phage. (A) Fine specificity of A 1.1 
hybridoma cells. The ability of poly 18 related peptide ana- 
logues to induce an IL-2 response was utilized to assess the fine 
specificity of Al . 1 hybridoma cells. The BALB/c spleen cells (1- 
A*) were pulsed with the indicated poly 18 related peptide 
analogues and co-cultured with A 1.1 hybridoma cells as 
described in the experimental procedures. The relative amount 
of IL-2 produced in response to the peptide antigens is indicated 
in the Y-%x\s as the amount of tritiated thymidine uptake in a 
CTLL assay. (B) Comparative fine specificity of peptide binding 
by recombinant Al.l-TCR phage. The binding capacity of 
recombinant phage displaying Al.l-TCR-VJCz was assessed 
using the same four poly 18 related peptide analogues used to 
assess the fine specificity of the A 1.1 hybridoma cells. Plates 
were coated with the indicated peptide analogues followed by 
panning with recombinant phage displaying AI.l TCR-VJCa. 
The number of bound phage following extensive washing was 
assessed by determining the number of eluted ampicillin resist- 
ant CFU. 



A 1.1 hybridoma from which the TCRa cDNA sequence 
was derived (see Fig. 3A), despite the absence of class II 
MHC presentation. 

Affinity selection of specific TCRa 

To assess the capacity for affinity selection of phage 
displaying specific TCR polypeptides, recombinant 
phage displaying A LI TCR-VJCa were mixed 1:100 or 
1:1000 with phage displaying 5C.C7-TCR-VJCa and 
then subjected to two rounds of affinity enrichment on 
plates coated with antigenic poly 18 peptide analogues. 
The eluted phage were used to infect host cells and col- 
onies from ampicillin plates were lifted onto nitro- 
cellulose filters. The colony-lifts were lysed and 
hybridized with a DNA probe specific for the A 1. 1 TCRa 



sequence. We found that 85% (468/554) of the colonies 
from the 1:100 mix and 76V. (113/149) of the colonies 
from the 1:1000 mix contained ALI TCRa specific 
sequences (Fig. 5). Thus, two rounds of selection enriched 
the phage displaying Al .1 TCRa by over 500- and 3000- 
fold, respectively. These results suggest that the phage - 
display system can be utilized to select for T cell receptors 
capable of binding directly to peptide h'gands and has 
potential to be utilized as a gene identifying system for 
TCRoe chains with relatively high affinity for their peptide 
Hgand. 

The TCRa binding directly to protein antigen 

While phage displaying 5C.C7-TCR-VJCa failed to 
bind to the poly 18 peptide analogues bound by A 1.1 
TCRx recombinant phage, these phage also failed to bind 
antigenic cytochrome C peptide, in repeated attempts 
(data not shown). Thus, differences in detectable binding 
of TCR Va to specific antigenic peptides may represent 
variance in the particular TCR-iigand interactions such 
that only a subset of TCR, including that on A 1.1 ceils, 
have direct interactions strong enough to be detectable 
in this system. Accordingly, we constructed recombinant 
phage displaying TCRa from the 3B3 T cell hybridoma, 
which has previously been suggested to express a TCRa 
chain product capable of direct interaction with antigen 
(Mori et al, 1993). The 3B3 hybridoma was generated 
from BALB/c mice immunized with bee venom phos- 
pholipase A 2 (PLAJ and is specific for a bee venom 
PL A 2 derived peptide (residues 19-34) in the context of 
!-A d (Mori et a/., 1993). Two aspects of 3B3 TCR binding 
characteristics made it particularly interesting for phage 
display studies of direct antigen-binding TCRa chains. 
Firstly, three-dimensional structural analysis of bee 
venom PLA 2 indicated that the antigenic peptide residues 
associated with 3B3 TCR binding are exposed to the 
outer surface of the PLA 2 molecule in the form of a 
flexible loop (Scott et a/ M 1990). Secondly, previous pep- 
tide competition studies of 3B3 TCR specificity suggested 
that the TCRa chain may be able to bind directly to 
unprocessed bee venom PLA 2 (Mori et a/., 1993). There- 
fore, we constructed recombinant phage displaying 3B3- 
TCR-VJCa to test the ability of this TCRa chain to 
directly bind to unprocessed bee venom PLA 2 protein in 
a specific manner. 

The PCR-generated fragments encoding the 3B3- 
TCR-VJCa polypeptide were subcloned into the 
pComb8 vector for production of recombinant phage. 
Bee venom PLA 2 was coated onto 96-well plates for pan- 
ning experiments to assess the binding capacity and speci- 
ficity of the 3B3 TCRa chains displayed on the phage 
surface. We observed binding of recombinant 3B3-TCR- 
VJCa phage, but not parental pComb8 phage (vector 
only), to bee venom PLA 2 coated wells (Fig. 6A). Fur- 
thermore, binding of recombinant 3B3-TCR-VJCa 
phage to BSA (3% BSA, used for blocking) was not 
observed. Moreover, the recombinant phage displaying 
3B3 TCR-VJCa did not bind to bovine PLA 3 (Fig. 6B), 
which lacks the exposed hydrophilic loop containing the 
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FigM. Antigenic binding is mediated by the VJ regions of specific TCR-recombinant phage. (A) The 
bma^g capacity of parental pCombS phage or recombinant phage displaying Al.l-TCR-VJOt or 
5C.C7-TCR-VJC« for the indited poly 18 related peptide analogues was assessed using the panning 
pro ocol described in the experimental procedures. Plates were coated with the indicated peptide 
analogues and panned with the indicated phage. The number of bound phage following extensive 
washing was assessed by determining the number of ampicillin resistant CFU//il. (B) The binding 
capacity of recombinant phage displaying TCR VJ* (truncated C-region) from the AI.I or 5C C7 
hybndoma cells to the four indicated poly 18 related peptide analogues was determined as described 
Z l ° ZFL* , " ' u C ^ , - re 8 ions wer * sufficie »« to mediate the specific binding to antigenic 
TTR vir xVd ? f oTsevenl «combio« t phage preparations, displaying 

TCR-VJCa or TCR VJ a . for the indicated poly 18 related peptide analogues are compared from 
seven different panning experiments as described above. The experiments described in (A) and (B) 

are included for comparison. 



antigenic peptide sequence associated with 3B3 TCR 
binding (Dijkstra et a/., 1 98 1; Mori ct al, y 1993). The 
observed preferential binding to antigenic bee venom 
PLA 2 was reproducible and consistent over a wide range 
of phage titres when we collected the quantitative data 
from five different experiments utilizing several different 
recombinant phage preparations (Fig. 6C). Thus, the 
phage display system enabled us to assess the novel 
capacity of a TCR* chain to bind directly to antigenic 
protein in the absence of MHC. 



In order to further assess the specificity of the TCRa 
recombinant phages for their respective antigenic ligands, 
we did parallel panning experiments with the At. 1 TCR- 
VJCa and 3B3 TCR-VJGx recombinant phage (Fig. 7). 
Phage displaying AI.I TCR-VJCa showed binding to 
the antigenic poly 18 peptide analogue (EYK- 
EYAEYAEYAEYAEYK) but not to bee venom PLA 2 . 
In contrast, phage displaying 3B3 TCR-VJCa dem- 
onstrated specific binding to bee venom PLA 2 but not to 
the poly 18 peptide analogue. Thus, the TCR phage- 
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Fig. 5. Affinity selection of specific TCR on recombinant phage. 
A mixing experiment was done to assess the capacity for affinity 
selection of peptide specific TCR displayed on the surface of 
recombinant All -TCRa-pComb8 phage. The Al.l-TCR- 
VJC* phages were mixed 1:100 or 1:1000 with 5C.C7-TCR- 
VJCot phage and subjected to two rounds of affinity selection 
by panning as described in the experimental procedures. Colony 
lifts were hybridized with a radiolabeled Al.l-TCRa specific 
oligonucleotide probe. Autoradiographs of the hybridized rep- 
lica filters are shown in panels A-D. Autoradiographs of control 
colony lifts from plates inoculated with unmixed 5C.C7 TCRae 
recombinant phage (panel A) or unmixed Al.l-TCRa recom- 
binant phage (panel B) are shown for comparison with plates 
from mixing experiments starting with Al.lot/5C.C7a ratios of 
1:100 (panel Q or 1:1000 (panel D) prior to the two rounds of 
affinity selection. 



display system described here facilitated the study of 
TCRot chains capable of binding directly to antigenic 
epitopes in a specific manner. 

DISCUSSION 

The phage-display system (Smith, 1985) has proved to 
be extremely useful for the study of specific receptor- 
ligand interactions (Winter et a/., 1994). The advantage 
of this system for immunological studies is that the dis- 
play of receptors on recombinant phage provides a means 
to link directly antigen recognition structures and the 
genetic instructions encoding the receptor (Kang et a/., 
1991). Consequently, the phage display system has been 
invaluable for the characterization and selection of anti- 
body specificities for antigenic ligands. Here, we have 
shown that this system can be utilized further to study 
specific binding interactions of constituent TCR poly- 
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Fig. 6. Antigenic specificity of recombinant phage displaying 
TCRat polypeptide from 3B3 hybridoma cells. (A) The binding 
capacity of parental pComb8 phage or recombinant phage dis- 
playing TCR-VJCa from the 3B3 hybridoma cell lo bee venom 
PLA 2 was assessed using the panning protocol described in the 
experimental procedures. Plates were coated with the indicated 
concentration of bee venom PLA 2 and then panned with 10* 
CFU of recombinant phage displaying 3B3-TCR-VJCa or par- 
ental pComb8 phage. The number of bound phage following 
extensive washing was assessed by determining^ the number of 
ampicillin resistant CFU//il. (B) The specificity of recombinant 
phage displaying TCR-VJGx from the 3B3 hybridoma cell for 
bee venom PLA 2 and bovine PLA 2 wa * assessed using the pan- 
ning protocol. The number of bound phage following panning 
with 10'° CFU of recombinant phage and extensive washing 
was ancssed by determining the number of ampicillin resistant 
CFU/jd. (Q The relative binding specificity of several recom- 
binant phage preparations, displaying 3B3-TCR-VJCa for bee 
venom PLA 2 or bovine PLA 2 are compared from five different 
panning experiments as described above. 
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peptides to antigenic ligands. Specifically, the phagc-dis- 
play system facilitated a novel demonstration of direct 
binding by TCRot chains to peptide or protein antigens 
in a specific manner. These results indicate that for some 
TCRa chains the binding affinity to antigenic peptide 
may be sufficiently high that it can be detected in the 
absence of the TCR0 chain, and in an MHC-independcnt 
manner. It has been postulated previously that TCR 
binding to peptide by Va- and V/f-subunits may be rela- 
tively autonomous and that the resulting TCR-peptide 
interactions can dramatically influence TCR-MHC inter- 
actions; indicating the "primacy** of TCR-peptide inter- 
actions (Ehrich et a/., 1993). Our results extend these 
findings by demonstrating that the dominant interactions 
of certain TCRa chains for peptide antigens may be 
sufficiently high that they can be analysed independently. 
However, these interactions are quite unusual in that they 
do not require the expression of the second TCR subunit 
or normal MHC and coreceptor interactions. These 
results raise the concern that this model does not reflect 
typical TCR-ligand interactions. Indeed; gene transfer 
studies of A 1.1 TCRa chain sequences indicated that the 
expression of the a chain alone was not sufficient to confer 
responsiveness to peptide antigen alone or in the context 
of I-A d (H. Zheng, A. Fotedar and D.R. Green; unpub- 
lished data). Furthermore, activation of A 1.1 hybridoma 
cells required specific peptide antigen to be presented in 
the context of the appropriate MHC (I-A d ) (Fotedar et 
a/., 1985). The requirement for MHC presentation and 
natural TCRa/f conformation for functional interaction 
of cell surface expressed receptors may be essential for 
bringing the respective binding sites into close enough 
proximity to establish the specific binding interaction 
between the residues of opposite charge which mediate 
binding. The specificity data presented here (Fig. 3 and 
Fig. 4 and Fig. 7) suggest that these requirements may 
not be necessary for some receptors to detect specific 



binding in non-cellular TCR phage-display systems. 
Therefore, the data presented here do not suggest that 
the TCRa chain alone would be sufficient for T cell rec- 
ognition and activation, but rather that the TCR phage- 
display system provides a means to explore the specific 
binding interactions that mediate TCR binding to antigen 
at a constituent level. While this study does not refute 
commonly held theories of TCR-antigen interactions, 
this system does provide a means to explore the innate 
specific binding interactions contributed by the con- 
stituent subunits of TCR complexes for the peptide com- 
ponent of MHC-peptide ligands. In the absence of 
detailed structural data for the TCR/pepude/MHC tri- 
molecular complex, models such as this can provide 
important insight into the nature of the elemental inter- 
actions that constitute TCR binding to MHC-peptide 
complexes. Furthermore, such information should 
facilitate interpretation of the structural data for the 
trimolecular complexes when this becomes avail- 
able. 

Previous studies indicate that TCR-peptide binding 
interactions are very specific, since single amino acid 
changes had profound effects on binding (Engel and Hed- 
rick, 1988; Danska et a/., 1990). In accordance with these 
results, the direct binding of phage-displayed TCRa 
chains to antigen appeared to be specific. Recombinant 
phage-displaying A 1.1 TCRa chains bound to antigenic 
peptide analogues but not to mutated peptide analogues 
that failed to activate A 1.1 hybridoma cells (Fig. 3 and 
Fig. 4). Additionally, the cross-specificity experiment 
indicated that recombinant phage-displaying A 1 . 1 TCR- 
VJCa or 3B3-TCR-VJCa bound their respective antigens 
but did not cross-react (Fig. 7). 

The failure of repeated attempts to demonstrate spec- 
ific binding of recombinant phage-displaying 5C.C7 
TCRa to cytochrome C peptide indicated that only a 
subset of TCR Va have the capacity for direct interactions 
with antigen strong enough to be detectable in this 
system. Previous studies of 5C.C7 TCR interactions with 
peptide antigen indicate that specific binding was 
mediated by CDR3 residues from both the a and 0 chains 
(Jorgensen et a/., 1992a, 1992b). Thus, binding to antigen 
by receptors such as 5C.C7 TCR may require the com- 
bined binding interactions of the a and chains to 
mediate strong specific binding, while antigen binding by 
receptors such as A 1.1 and 3B3 TCR may be dominated 
by specificities mediated predominandy by the TCRa 
chain. 

The phage-display system described here extends the 
study of T cell recognition by providing a means to sim- 
plify the study of the specific interactions that constitute 
the overall receptor-ligand interactions. The power of 
the phage-display system is that recombinant phage can 
be amplified and subjected to multiple rounds of affinity 
selection as they contain the genetic instructions for pro- 
duction of the receptor molecule. A system for selection 
and cloning of antigen-binding TCR molecules will be 
useful in exploring TCR binding to various antigens and 
should provide a means to more closely examine TCR- 
ligand interactions. 
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ABSTRACT T-cell receptors (TCRs) are membrane an- 
chored heterodimers structurally related to antibody mole- 
cules. Single-chain antibodies can be engineered by linking the 
two variable domains, which fold properly by themselves. 
However, proper assembly of the variable domains of a human 
TCR (V« and V p ) that recognize the HLA-DR2b/ myelin basic 
protein-(85-99) peptide complex was critically dependent on 
the addition of a third domain, the constant region of the TCR 
ft chain (Cp), to the single-chain construct. Single-chain mol- 
ecules with the three-domain design, but not those with the 
two-domain design, expressed in a eukaryotic ceH as chimeric 
molecules linked either to glycosyl phosphatidylinositol or to 
the transmembrane/cytoplasmic domains of the CD3 f chain 
were recognized by a conformation-sensitive monoclonal anti- 
body. The chimeric three-domain single-chain TCR linked to 
CD3 £ chain signaled in response to both the specific HLA- 
DR/peptide and the HLA-DR/superantigen staphylococcal 
enterotoxin B complexes. Thus, by using this three-domain 
design, functional single-chain TCR molecules were expressed 
with high efficiency. The lipid-linked single-chain TCR was 
solubilized by enzymatic cleavage and purified by affinity 
chromatography. The apparent requirement of the constant 
domain for cooperative folding of the two TCR variable 
domains may reflect significant structural differences between 
TCR and antibody molecules. 



T-cell receptor (TCR) recognition of antigen fragments pre- 
sented by major histocompatibility complex (MHC) mole- 
cules is a critical step in the initiation of a specific immune 
response (1,2). The TCR a and 0 chains are each composed 
of two immunoglobulin-like domains; most of the amino acid 
residues that are found to be highly conserved in the variable 
(V) region of immunoglobulins are also found in TCR V 
regions, suggesting that the tertiary structure of the TCR may 
resemble that of immunoglobulins (3, 4). However, TCR V 
regions have significantly more primary sequence variability, 
an increased apparent rate of divergence in phylogeny, and 
peaks of variability in addition to those noted in immuno- 
globulins (5, 6). In order to understand and control the 
molecular interactions underlying T-cell recognition of 
MHC/ peptide complexes; complete structural knowledge of 
the TCR is required. 

Several approaches have been employed to produce sol- 
uble, recombinant TCRs. In these recombinant TCR mole- 
cules, the transmembrane/cytoplasmic regions of a and f} 
chains were replaced with sequences from lipid-linked pro- 
teins (7), the CD3 ( chain (8), or immunoglobulins (9, 10). 
Soluble TCRs were either recovered as secreted proteins or 
obtained by enzymatic cleavage of the surface-expressed 
recombinant proteins. All of these approaches rely, however, 
on the assembly of the heterodimer, which is inefficient (11). 
In addition, high-level expression of the human TCR a chain 
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in transfected eukaryotic cells is not stable. These problems 
can be avoided by the design of a single-chain (sc) recombi- 
nant protein in which the V regions of the heterodimer are 
joined by a short peptide linker. Such a design has been 
successfully applied to antibody molecules (12). Such recom- 
binant molecules, scFv, have a specificity and affinity similar 
to that of native antibodies (12). Several reports have de- 
scribed the production of scTCRs in bacterial expression 
systems using the sc antibody (Fv) design (13-16), but none 
have presented functional data indicating that these scTCRs 
could recognize their MHC/peptide complexes or superan- 
tigens. Recently, however, the production in bacteria of a 
scTCR with the two-domain Fv design that could recognize 
its natural ligands has been reported, although the fraction 
correctly refolded was extremely low (17). 

In the present report, different scTCR designs were eval- 
uated in transfected eukaryotic cells with respect to surface 
expression of TCR molecules, proper folding, and recogni- 
tion of the appropriate MHC/pepude ligand. A three-domain 
sc construct [a-chain V (Vji-linker-^-chain V (V^HS-chain 
constant (Cp)] was stably expressed on the cell surface when 
linked to a glycosyl phosphatidylinositol (GPI) anchor and 
recognized by a conformation-dependent monoclonal anti- 
body (mAb) specific for the Vpll segment. The soluble form 
of this recombinant protein could be readily obtained by 
enzymatic cleavage with phosphatidylinositol-specific phos- 
pholipase C (PI-PLC). Replacement of the GPI domain with 
the cytoplasmic portion of the ( chain resulted in a functional 
TCR molecule that transduced an intracellular signal follow- 
ing recognition of either the proper MHC/peptide or the 
MHC/staphylococcal enterotoxin B (SEB) superantigen 
complex. The production of a functional scTCR directly 
demonstrates the feasibility of employing sc design to pro- 
duce soluble TCRs. 

MATERIALS AND METHODS 

Construction of Recombinant TCR Molecules. cDNAs of 
TCR a and p chains were prepared from mRNA of Hy.2H9 
cells (18) with Superscript reverse transcriptase (BRL) and an 
oligo(dT) primer (Sigma) and were amplified by PCR using 
Vent DNA polymerase (New England Biolabs) and primers 
5'-GCTCGAGGCGGCGATGGAAACTCtCCTGGGAGT-3' 
(AS) and 5'-GGAATTCAGCTGGACCACAGCCGC-3' for 
a-chain and 5'-GCTCGAGCTCTGCCATGGACTCCTG- 
GA-3' and 5 ' -GG AATTC AG AA ATCCTTTCTCTTG AC-3 ' 
for /3-chain. The cDNAs were cloned into the mammalian 
expression vector pBJ-neo (8). GPI-anchored TCR molecules 
(a-PI and 0-PI) were constructed as follows. A Ban I site was 
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engineered after the fifth amino acid residue beyond the last 
cysteine by oligonucleotide-directed mutagenesis. The region 
3' of the Ban I site was then replaced with a Ban l-Not I 
fragment encoding the GPI signal domain from the human 
placental alkaline phosphatase. For the construction of vari- 
ous scTCRs, variable domains of the 2H9 TCR a and chains 
were prepared by PGR using primers A5 and 5'-CAGAGCT- 
C ACGG ATG AAC A ATA AGGCTGGT-3 ' for the V a domain 
in all the scTCR constructs, 5'-TCGGATATCGATGGTG- 
GAATCACTCAGTCC-3' (B5) and 5 ' -C AG AG ATC AG- 
C ACGGTG AGCCGGTTCCC-3 ' for the domain in AB- 
PM, 5 '-GTGGG AG ATCTCTGCTTCTG ATGGCTC AAAC 
and B5 for the V„ domain in AB-PI-2, 5'-CACGGATC- 
CCCGTCTGCTCTACCCCAGGC and B5 for the V„ and C 0 
domains in ABC-PI, and 5'-CACGGATCCCCGTCTGCTC- 
TACCCCAGGC-3' and B5 for the and C„ domains in 
ABC-f. The cDNA encoding the transmembrane and cyto- 
plasmic domains of murine CD3 ( chain (8) was a gift of R. D. 
Klausner (National Institutes of Health). Convenient restric- 
tion sites were engineered at the end of each fragment to aid 
in the assembly of the % constmct. The linker was a 15-amino 
acid motif of GGGGS repeated three times (12) with Sac I at 
the 5' end and EcoRV at the 3' end. Except for a-PI, all the 
constructs were cloned into pBJ-neo, which carries the G418- 
resistance gene. a-PI was cloned into pCEP-4 (Invitrogen), 
which bears the hygromycin-resistance gene. All constructs 
were verified by multiple restriction digests and by sequencing 
with the Sequenase kit (United States Biochemical). 

Affinity Purification and Characterization of a Soluble 
Three-Domain scTCR. After transfectibn and G418 selection 
(8), cells expressing a high level of GPI-linked three-domain 
scTCR (ABC-PI) were isolated by three rounds of sorting. 
The resulting cells were grown in spinner culture to a density 
of 10 6 per ml and harvested by centrifugation. The pellet was 
washed twice with phosphate-buffered saline (PBS) and 
resuspended in PBScontaining 2 mM Pefabloc (CenterchenV 
Stamford, CT) to a density of 5 x 10 7 per ml with PI-PLC 
(Sigma) added at 1 unit/ml. Cells were incubated at 37°C for 

1 hr with constant rocking. The supernatant was collected by 
centrifugation and by passage through a 0.45-/tm filter and 
applied to a column of Acti-gel (Sterogen, Arcadia, CA) with 
immobilized /3F1. The column was washed with 10 volumes 
of PBS and the soluble TCR was eluted with 0.15 M glycine 
(pH 2.8). Fractions were immediately neutralized with 0.1 
volume of saturated Tris. The soluble TCR was then diaiyzed 
against > 100 volumes of PBS at 4°C with at least four changes 
and concentrated to 0.5 rag/ml by vacuum dialysis against 
PBS. Five micrograms of purified soluble three-domain 
scTCR was analyzed by SDS/PAGE under reducing condi- 
tions. 

Stimulation of Transfectants with Antibodies, SEB, and 
PepUde/MHC Ligands. ABC-f-transfected BW51470 a~p~ 
(19) cells (5 x 10 4 per well) were cultured in a 96-well 
round-bottom plate to which various antibodies had been 
immobilized (1 y% per well). The supernatants were collected 
after 24 hr and interleukin 2 (IL-2) production was assessed 
in a bioassay using an IL-2-dependent cell line (CTLL) and 
the CellTiter-96 nonradioactive proliferation assay 
(Promega). In the case of ABC-f-transfected RBL-2H3 (8) 
cells, the cells were incubated with [ 3 H]serotonin (NEN) at 
0.5 /tCi (18.5 kBq) for 24 hr before they were added to the 
antibody plate. After incubation at 37°C for 2 hr, radioactivity 
released into the supernatant was measured in a liquid 
scintillation counter. The specific serotonin release was 
calculated as described (8), For SEB stimulation, 5 x 10 4 
transfected cells per well were cultured with various con- 
centrations of SEB (Toxin Technology, Sarasota, FL) in the 
presence or absence of 2 x 10 5 B cells. For antigen presen- 
tation, 5 x 10 4 transfected cells per well were cocultured with 

2 x 10 5 B cells which were incubated with or without the 



myelin basic protein (MBP)-(85-99) peptide for 3 hr before 
the experiment. The assays were conductedas described 
above. 

RESULTS 

mAb CI Recognizes a Conformational Epitope of TCR. 
Recombinant TCR molecules were generated by using the 
TCR a- and 0-chain sequences of the human MBP-specific 
T-cell clone Hy.2H9 (18). This clone TCR is composed of the 
Vo3.1 and Vpl7.1 segments and is specific for the immuno- 
dominant MBP peptide MBP-(85-99) in the context of HLA- 
DR2 (DRA, DRB1*1602) (18). Usage of the V p 17.1 segment 
allowed the proper folding of recombinant TCRs to be probed 
with the superantigen SEB (20) and the mAb CI (21). To 
confirm the Vpl7 specificity of CI, the extracellular domains 
of TCR a and p chains of Hy.2H9 cells were fused to the 
C-terminal sequence from human placental alkaline phospha- 
tase for GPI anchorage (Fig. 1) and the DNAs encoding the 
GPI-anchored and a chains (£-PI and a-PI) were sequen- 
tially transfected by electroporation (8) into a TCR a- and 
0-chain-deficient murine lymphoma cell line, BW5147 a~f$~ 
(BW~) (19). The surface expression of the GPI-anchored 
TCR chains was monitored by staining with mAbs oeFl (22), 
0F1 (23), and CI. aFl and 0F1 recognize nonconformationai 
epitopes located in the C region of the TCR or and & chains, 
respectively. Surface expression of the GPI-anchored TCR f$ 
chain is independent of heterodimer formation and assembly 
of the CD3 complex (7). In the /3-PI-transfected cells (Fig. 2, 
open curves), high-level expression of /3-PI was confirmed by 
staining with 0F1. Interestingly, there was little CI staining 
of these transfectants. However, when a GPI-anchored 2H9 
a chain was supertransfected into the /3-PI transfectant (Fig. 
2, shaded curves), CI reactivity was greatly increased while 
the level of 0F1 staining remained constant. Thus, the 
V^17-specific CI epitope is conformational and dependent on 
the proper pairing of TCR a and p chains and can therefore 
be used to assess the proper folding of recombinant TCRs 
bearing a Vpl7 sequence. 
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Fio. 1. Schematic representation of TCR of- and £-chain genes 
and various chimeric constructs. S — S, disulfide bond; L, leader; Y, 
variable segment; J, joining segment; C, constant region; TM, 
transmembrane region; Cy, cytoplasmic region; ATG, start codon; 
Li t 15-residue peptide linker containing three repeats of GGGGS; PI, 
GPI domain of human placental alkaline phosphatase with the 
sequence LAPPAGTTDAAHPGRSWPALLPLLAGTLLLL (7). 
The £ region contains transmembrane and cytoplasmic domains of 
the murine CD3 ( chain starting at position 31 (8). 
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Fig. 2. Cocxpression of a-PI and 0-PI are required for CI 
reactivity. Flow cytomeric analysis of 0-PI-transfected BW~ cells 
stained with mAb aFl, /3F1, or CI before (open curves) and after 
(shaded curves) the supertransfection of the a-PI construct. 

High-level expression of the TCR a chain (a-PI) was, 
however, not stable either alone or in the presence of 0-PL 
Attempts were made on several cell lines, including COS-7, 
CHO-K1, and a TCR-deficient variant of Jurkat cells, JK-jT 
(J.RT3-T3.5, American Type Culture Collection). The ex- 
pression level of a-PI was comparable to that of 0-PI after the 
initial drug selection, but continued culture for less than a 
month yielded a population of cells with little surface ex- 
pression of a-PI, whereas )3-PI expression was stable (data 
not shown). The inability to obtain cell lines with stable 
high-level expression of the Pl-anchored human TCR o-chain 
has been reported by other laboratories as well (24). 

Expression and Purification of a Three-Domain scTCR. To 
overcome the limitations set by the unstable expression of the 
human TCR a chain, various sc designs were examined. 
Initially, a design similar to that of sc antibodies (Fv) was 
chosen (12). A 15-residue flexible linker was used to link the 
C terminus of the V^-Ja domain to the N terminus of the /3 
chain. The GPI domain was then ligated to the C terminus of 
the V^-Jp domain. The construct (AB-PI-1, Fig. 1) was 
transfected into several cell lines, including JK-/T, COS-7, 
CHO-K1, and BW~. Although the expression of the gene was 
confirmed by the detection of the correct RNA transcripts 
(Fig. 32?), no surface expression was detected, as evidenced 
by negative CI antibody staining (Fig. 3A). Immunoprecip- 
itation after metabolic labeling failed to recover any Cl- 
reactive sc molecules from these transfectants (data not 
shown). The inability to identify any Cl-reactive protein 
could have been due to the design of this molecule, such as 
insufficient linker length between the extracellular domain 
and the GPI domain. To improve the accessibility of the sc 
construct, another two-domain scTCR was designed in which 
an extra 30-amino acid portion of the N terminus of the C$ 
domain was added as a hinge region. The transfectants of this 
construct (AB-PI-2, Fig. 1) were still not reactive with the CI 
antibody (data not shown). Finally, the entire C^ domain was 
added to the sc construct. A complete C$ domain should 
provide enough distance for the V^V^ domains to be ex- 
pressed on the cell surface and, more importantly, should 
allow surface expression to be monitored with another anti- 
body, )3F1 (23). This three-domain scTCR was constructed 
by extending the TCR 0-chain sequences to the residue right 
before the last cysteine (the sixth cysteine), which was then 
fused to the GPI domain. The last cysteine was deleted to 
prevent dimerization between C^ domains. This construct 
(ABC-PI, Fig. 1) was transfected into BW" cells and surface 
expression was confirmed by staining with both jSFl and CI 
(Fig. 3C, shaded curves). Both antibodies stained the cells 
with comparable efficiency, suggesting that most of the 
molecules were expressed in the correct conformation. 
Moreover, the molecule could be efficiently cleaved from the 
cell surface with PI-PLC (Fig. 3C, open curves). 

Soluble three-domain scTCR was purified from transfec- 
tants after PI-PLC cleavage followed by affinity chromatog- 
raphy using the 0F1 antibody. The purified three-domain 
scTCR appeared as multiple bands at 50-70 kDa after SDS/ 
PAGE (Fig. 3D). The heterogeneity of scTCR is probably the 
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Fio. 3. Flow cytomeric analysis of surface expression of scTCR 
constructs. (A) Lack of CI reactivity in BW" cells transfected with 
two-domain scTCR (AB-PM). (B) RNA analysis of poly(A)-enriched 
total cell RNA isolated from BW" cells transfected with two-domain 
and three-domain scTCR constructs AB-PI-1 and ABC-PI, respec- 
tively. Samples were analyzed in a nuclease SI protection assay (25) 
using probes specific for the. 5' end of the transcripts from the TCR 
constructs. The coexpressed heo (G418-resistance gene) transcripts 
from the vector were analyzed with a probe at the same time as a 
control. (C) Flow cytometric analysis of ABC-PI-transfected BW" 
cells with both 0F1 and CI antibodies before (shaded curves) and 
after (open curves) PI-PLC treatment. (D) SDS/PAGE of affinity- 
purified three-domain scTCR. (E) Comparison of CI reactivity of 
three-domain scTCRs produced from eukaryotic (Euk) and prokary- 
otic (Prok) expression systems in a two-antibody ELISA. A plateau 
is reached because the amount of £F1 attached to the plate became 
limiting. 

result of variable glycosylation; its polypeptide size calcu- 
lated from amino acid composition is 40 kDa. The structural 
integrity of the three-domain scTCR was verified by a two- 
antibody ELISA (Fig. 3£). The molecules were first captured 
by the jSFl antibody immobilized to the plate and then 
assessed for reactivity with the CI antibody. When compared 
with the three-domain scTCR produced in a bacterial expres- 
sion system (unpublished work), the scTCR from the eukary- 
otic system gave 10-20 times higher CI reactivity. The 
purified three-domain scTCR was stable and could be stored 
in PBS at 4°C for months without significant loss of CI 
reactivity. 

Functional Characterization of a Chimeric Three-Domain 
scTCR. To directly assess the functional integrity of the 
three-domain scTCR, a self-signaling scTCR was produced 
by replacing: the GPI domain with the transmembrane and 
cytoplasmic domains of the CD3 £ chain. These regions have 
been shown to be sufficient for signal transduction when its 



Immunology; Chung et at. 



Froc. Natl. Acad, Set. USA 91 (1994) 1263/ 



extracellular fusion partner is crosslinked by an antibody or 
by the proper ligand (8, 26, 27). To enable the recovery of 
three-domain scTCR as a soluble form, a linker containing a 
thrombin cleavage site was inserted into the junction of 
three-domain scTCR and the I domain. The construct 
(ABC-0 was transfected into BW" cells (28) and the rat 
basophilic leukemia cell line RBL-2H3 (RBL) (8), and the 
populations displaying high-level expression of three-domain 
scTCR were isolated by three rounds of cytofluorometric 
sorting using the antibody 0F1. The ABC- f- transfected cells 
were first stimulated with various antibodies to confirm the 
self-signaling nature of this recombinant molecule. The signal 
transduced upon the activation of the three-domain scTCR 
was measured as IL-2 production in BW" transfectants, 
whereas serotonin release was measured in RBL transfec- 
tants. Both transfectants showed a strong response following 
pFl and CI stimulation but not to purified mouse immuno- 
globulin or anti-CD8 antibody used as controls (Fig. 4 A and 
B). The structural integrity of the scTCR was further exam- 
ined with the superantigen SEB, which binds to both V>17 



and MHC class II molecules, resulting in TCR crosslinking 
and T-cell activation regardless of the peptide bound to the 
MHC molecule (20, 29). ABC-£ transfectants displayed a 
concentration-dependent response toward SEB (Fig. 4C) 
when the superantigen was presented by transformed B-cell 
lines with high-level expression of DR1 (DRA, DRB1*0101; 
cell line LG2) or DR2 (DRA, DRB1*1602; cell line 9016). 
Thus, the lateral face of the TCR V p region to which SEB is 
thought to bind (30) is structurally intact. 

To prove that the three-domain scTCR did indeed recog- 
nize the MHC/peptide ligand, antigen presentation experi- 
ments using the natural ligand for the Hy.2H9 clone, 9016 
cells bearing the DRB1*1602 allele of DR2, and MBP-(85-99) 
peptide were performed. To ensure detection of subtle ab- 
normalities in the structure of the three-domain scTCR, 9009 
cells (DRA, DRB 1*1601), which also bind MBP-(85-99), 
were used as a control. DRB1*1601 and DRB1*1602 differ 
only at position 67 in the DR^l domain; this TCR contact- 
residue substitution does, however, abolish recognition of 
the peptide by the parent T-cell clone (ref. 18; K.W.W., 
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unpublished work). The ABC-£-transfected BW" cells se- 
creted IL-2 in response to peptide-pulsed 9016 cells, but not 
to peptide-pulsed 9009 cells (Fig. 4I>). Similar results were 
obtained with RBL transfectants (Fig. 4E) t as serotonin 
release was dependent on the concentration of the MBP 
peptide used to pulse 9016 cells. The signal appeared to be 
weak when compared with antibody stimulation. This is not 
surprising, however, since saturating amounts of antibodies 
are expected to crosslink the majority of TCR molecules on 
the target cell, whereas a much smaller fraction of TCR 
molecules is probably engaged when T cells are cocultured 
with peptide-pulsed antigen-presenting cells, in which a max- 
imum of 5-15% of the DR molecules bind the peptide. The 
requirement for a high concentration of peptide or SEB is not 
due to the sc design, since high concentrations of peptide are 
also needed to stimulate af$ heterodimers of the TCR-f 
constructs (8). It is likely that the decrease in sensitivity 
results from the lack of CD3, CD4, and/or other adhesion/ 
signaling molecules. Nonetheless, these results demonstrate 
that the three-domain scTCR was correctly folded and func- 
tionally competent. In addition, a soluble form of three- 
domain scTCR could be obtained from the ABC-f transfec- 
tants by thrombin cleavage and affinity purification (data not 
shown). 

DISCUSSION 

A scTCR molecule was designed which contains the V 
domains of both a and 0 chains and the C domain of the 
chain. This scTCR molecule could be stably expressed at a 
high level in eukaryotic cells and could be isolated in a soluble 
form by enzymatic cleavage and affinity chromatography. 
The V rt and Vp domains appeared to be properly paired, since 
the scTCR bound to a conformation-dependent mAb, the 
superantigen SEB, and the proper MHC/peptide ligand. This 
design of scTCR offers an alternative to the two-chain design 
of soluble TCRs and has several advantages, (i) The sc design 
avoids the low-efficiency dimerization process which may be 
the limiting step in the assembly of TCR heterodimers from 
a and 0 subunits made in Escherichia colt. The sc design 
therefore allows efficient expression of the recombinant 
protein in quantities suitable for structural analysis and for 
some diagnostic or therapeutic applications. (it) The design 
avoids the problems associated with the unstable expression 
of the human TCR a chain that have hindered efficient 
expression of human TCR molecules in eukaryotic cells. (Hi) 
The sc design may allow the construction of TCR phage 
display libraries similar to those made for sc antibodies (28, 
31). scTCR phage libraries may be powerful tools for the 
isolation of TCRs with defined specificities and/or high 
affinity for selective targeting of malignant and virally in- 
fected cells and for analyzing the interactions among TCR, 
MHC/peptide complexes, and superantigens. 

Unlike antibodies, separately expressed V domains of TCR 
a and j3 chains have not been reconstituted to form het- 
erodimers (ref. 32; K. L. Hilyard, personal communication). 
However, despite this success in producing the three-domain 
scTCR, a two-domain scTCR with detectable CI reactivity 
could not be produced either in eukaryotic cells or in bacte- 
ria. The presence of the Cp domain (or part of the domain) 
may be required for the proper folding and/or stabilization of 
a scTCR molecule. In any event, the three-domain design 
provides a general means for the efficient production of 
functional scTCR molecules. 
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HETEROBEVIERIC RECEPTOR LIBRARIES 
USING PHAGEMIDS 

CROSS-REFERENCE TO RELATED 
APPLICATION 

This application is a continuation of U.S. patent applica- 
tion Ser. No. 07/826,623. filed on Jan. 27. 1992. now 
abandoned, which is a continuation-in-part of U.S. patent 
application Ser. No. 07/683.602. filed Apr. 10. 1991. now 
abandoned, the disclosures of which are hereby incorporated 
by reference. 

This invention was made with government support under 
Grant No. CA 27489 awarded by the National Institutes of 
Health. 

TECHNICAL FIELD 

The present invention relates to cloning vectors and 
methods for producing a library of DNA molecules capable 
of expressing a fusion protein on the surface of a filamentous 
phage particle. 

BACKGROUND 

Filamentous bacteriophages are a group of related viruses 
that infect bacteria. They are termed filamentous because 
they are long and thin particles comprised of an elongated 
capsule that envelopes the deoxyribonucleic acid (DNA) 
that forms the bacteriophage genome. The F ptii filamentous 
bacteriophage (Ff phage) infect only gram-negative bacteria 
by specifically adsorbing to the tip of F pili, and include fd, 
fl and M 13. 

The mature capsule of Ff phage is comprised of a coat of 
five phage-encoded gene products: cpVffl. the major coat 
protein product of gene VTH that forms the bulk of the 
capsule; and four minor coat proteins, cpm and cpIV at one 
end of the capsule and cpVII and cpDC at the other end of the 
capsule. The length of the capsule is formed by 2500 to 3000 
copies of cpvm in an ordered helix array that forms the 
characteristic filament structure. About five copies each of 
the minor coat proteins are present at the ends of the capsule. 
The gene Hi-encoded protein (cpHI) is typically present in 
4 to 6 copies at one end of the capsule and serves as the 
receptor for binding of the phage to its bacterial host in the 
initial phase of infection. For detailed reviews of Ff phage 
structure, see Rasched et al.. Microbiol Rev^ 50:401-427 
(1986); and Model et aL. in 'The Bacteriophages, Volume 
2", R. Calendar, Ed., Plenum Press, pp. 375-456 (1988). 

The assembly of a Ff phage particle involves highly 
complex mechanics. No phage particles are assembled 
within a host cell; rather, they are assembled during extru- 
sion of the viral genome through the host cell's membrane. 
Prior to extrusion, the major coat protein cpVUJ and the 
minor coat protein cpm are synthesized and transported to 
the host cell's membrane. Both cpVm and cpm are 
anchored in the host cell membrane prior to their incorpo- 
ration into the mature particle. In addition, the viral genome 
is produced and coated with cpV protein. During the extru- 
sion process, cpV- coated genomic DNA is stripped of the 
cpV coat and simultaneously recoated with the mature coat 
proteins. The assembly mechanisms that control transferral 
of these proteins from the membrane to the particle is not 
presently known. 

Both cpm and cpVTH proteins include two domains that 
provide signals for assembly of the mature phage particle. 
The first domain is a secretion signal that directs the newly 
synthesized protein to the host cell membrane. The secretion 
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signal is located at the amino terminus of the protein and 
targets the protein at least to the cell membrane. The second 
domain is a membrane anchor domain that provides signals 
for association with the host cell membrane and for asso- 

5 ciation with the phage particle during assembly. This second 
signal for both cpVTH and cpffl comprises at least a hydro- 
phobic region for spanning the membrane. 

cpvm has been extensively studied as a model membrane 
protein because it can integrate into lipid bilayers such as the 

10 cell membrane in an asymmetric orientation with the acidic 
amino terminus toward the outside and the basic carboxy 
terminus toward the inside of the membrane. The mature 
protein is about 50 amino acid residues in length of which 
11 residues provide the carboxy terminus. 19 residues pro- 

15 vide the hydrophobic transmembrane region, and the 
remaining residues comprise the amino terminus. Consid- 
erable research has been done on the secretion signal region 
of cpvm to advance the study of membrane protein syn- 
thesis and targeting to membranes. However, tittle is known 

20 about the changes that are tolerated in the structure of the 
cpvm membrane anchor region that would allow for assem- 
bly of phage particles. 

Manipulation of the sequence of cpm shows that the 
C-terminal 23 amino acid residue stretch of hydrophobic 

25 amino acids normally responsible for a membrane anchor 
function can be altered in a variety of ways and retain the 
capacity to associate with membranes. However, those 
anchor-modified cpm proteins lost their ability to geneti- 
cally complement gene HI mutants indicating that the 

30 requirements of a membrane anchor for functional assembly 
have not been elucidated. 

Ff phage -based expression vectors have been described in 
which the entire cpm amino acid residue sequence was 
modified by insertion of short polypeptide "epitopes'* 
[Parmely et aL, Gene, 73:305-318 (1988); and Cwiria et al.. 
Prvc. Natl. Acad. Set. USA 87:6378-6382 (1990)] or an 
amino acid residue sequence defining a single chain anti- 
body domain. McCafferty et al.. Science, 348:552-554 

^ (1990). These hybrid proteins were synthesized and 
assembled onto phage particles in amounts of about 5 copies 
per particle, a density at which normal cpTJI is usually found. 
However, these expressed fusion proteins include the entire 
cpm amino acid residue sequence and do not suggest fusion 

45 proteins that utilize only the carboxy terminal membrane 
anchor domain of cpm. 

In addition, no expression system has been described in 
which a phage coat protein has been engineered to allow 
assembly of a heteromeric molecule that is functional and 

50 capable of incorporation into the coat of a phage particle. 

BRIEF SUMMARY OF THE INVENTION 

A new. high density, surface-integration technology has 
been discovered for expressing a recombinant gene product 

55 on the surface of a filamentous phage containing the recom- 
binant gene. The invention uses a filamentous phage cpvm 
membrane anchor domain as a means for linking gene- 
product and gene during the assembly stage of filamentous 
phage replication. 

60 That is, during filamentous phage replication, gene vm- 
encoded proteins assemble into a matrix which encapsulates 
the phage genome. It has now been discovered that (1) phage 
assembly is not disrupted when recombinant gene Vffl- 
encoded proteins are present, (2) recombinant gene VHI- 

65 encoded proteins can be integrated into the assembling 
matrix, and (3) integration into the matrix can be directed to 
occur in a surface-accessible orientation. 
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The present invention can be advantageously applied to FIG. 3 illustrates the sequence of the double-stranded 

the production of heteroraeric receptors of predetermined synthetic DNA inserted into Lambda Zap to produce a 

specificity, i.e.. it can be used to produce antibodies. T-cell Lambda Hc2 expression vector. The preparation of the 

receptors and the like that bind a preselected ligand, double-stranded synthetic DNA insert is described in 

Thus, the present invention provides for linking the func- 5 Example la(ii). The various features required for dais vector 

tions of heteromeric receptor recognition and filamentous to express the V^-coding DNAhomologs include the Shine- 

phage replication in a method for isolating a heteromeric Dalgarno ribosome binding site, a leader sequence to direct 

receptor. The method produces a filamentous phage com- the expressed protein to the periplasm as described by 

prised of a matrix of gene Vm-encoded proteins that encap- Mouva et al., J. BioL Chenu 255:27. 1980, and various 

sulate a recombinant genome. The recombinant genome 10 restriction enzyme sites used to operatively link the V H 

contains genes encoding the proteins of the heteromeric homologs to the expression vector. The V H expression 

receptor proteins. The heteromeric receptor is surface- vector sequence also contains a short nucleic acid sequence 

integrated into the encapsulating matrix via a gene VIII- that codes for amino acids typically found in variable 

encoded membrane anchor domain that is fused by a peptide regions heavy chain (V„ Backbone). This V„ Backbone is 

bond during translation to one of the heteroraeric receptor 15 just upstream and in the proper reading as the V w DNA 

proteins. The heteromeric receptor and the genes which homologs that are operatively linked into the Xho I and Spe 

encode it are physically linked during the assembly stage of I cloning sites. The sequences of the top and bottom strands 

the phage replication cycle. Specifically binding the of the double-stranded synthetic DNA insert are listed 

receptor-coated phage to a solid-support advantageously respectively as SEQ. ID. NO. 1 and SEQ. ID. #0. 2. The 

provides a means for isolating a recombinant genome that ^ synthetic DNA insert is directionaily ligated into Lambda 

encodes a desired heteromeric receptor from a diverse Zap n digested with the restriction enzymes Not 1 and Xho 

library of recombinant genomes. I to form Lambda Hc2 expression vector. 

In one embodiment the present invention contemplates FIG. 4 illustrates the major features of the bacterial 

an antibody molecule comprising heavy- and light-chain expression vector Lambda Hc2 (V„ expression vector). The 

proteins, said heavy-chain protein comprising a V^rdomain w synthetic DNA sequence from FIG. 3 is shown at the top 

flanked by an ammo-terminal prokaryotic secretion signal along with the T 3 polymerase promoter from Lambda Zap IL 

domain and a carboxy-terminal filamentous phage cpVTH The orientation of the insert in Lambda Zap n is shown. The 

membrane anchor domain, said light chain protein compris- V„ DNA homologs are inserted into the Xho I and Spe I 

ing a V^-domain fused to an amino-terminal prokaryotic cloning sites. The read through transcription produces the 

secretion signal domain. 30 decapeptide epitope (tag) that is located just 3* of the cloning 

In another embodiment, the present invention contem- site- 
plates a vector for expressing a fusion protein, said vector FIG. 5 illustrates the sequence of the double-stranded 
comprising a cassette that includes upstream and down- synthetic DNA inserted into Lambda Zap to produce a 
stream translatable DNA sequences operatively linked via a Lambda Lc2 expression vector. The various features 
sequence of nucleotides adapted for directional ligation of 35 required for this vector to express the V^-coding DNA 
an insert DNA. said upstream sequence encoding a prokary- homologs are described in FIG. 3. The V L -coding DNA 
otic secretion signal, said downstream sequence encoding a homologs are operatively linked into the Lc2 sequence at the 
filamentous phage gene cpVUI membrane anchor, said Sac I and Xho I restriction sites. The sequences of the top 
translatable DNA sequences operatively linked to a set of and bottom strands of the double-stranded synthetic DNA 
DNA expression signals for expression of said translatable 40 insert are listed respectively as SEQ. ID. NO. 3 and SEQ. ID. 
DNA sequences as portions of said fusion protein. NO. 4. The synthetic DNA insert is directionaily ligated into 

Lambda Zap II digested with the restriction enzymes Sac I 

BRIEF DESCRIPTION OF THE DRAWINGS ^ No( J£ j^l^m, Lc2 expression vector. 

In the drawings forming a portion of this disclosure: ^Iq $ illustrates the major features of the bacterial 

FIG. 1 illustrates a schematic diagram of the immunoglo- 45 expression vector Lc2 (V L expression vector). The synthetic 

bulin molecule showing the principal structural features. DNA sequence from FIG. 5 is shown at the top along with 

The circled area on the heavy chain represents the variable the T 3 polymerase promoter from Lambda Zap n. The 

region (V w ), a polypeptide containing a biologically active orientation of the insert in Lambda Zap H is shown. The 

(ligand binding) portion of that region, and a gene coding for DNA homologs are inserted into the Sac I and Xho I cloning 

that polypeptide, are produced by the methods of the present ^ sites. 

invention. FIG. 7 illustrates the dicistronic expression vector. 

FIG. 2Ais a diagrammatic sketch of a heavy (H) chain of pCorab. in the form of a phagemid expression vector. To 

human IgG (IgGl subclass). Numbering is from the produce pComb. phagemids were first excised from the 

N-terminus on the left to the C-terminus on the right Note expression vectors. Lambda Hc2 and Lambda Lc2. using an 

the presence of four domains, each containing an intrachain 55 in vivo excision protocol according to manufacturers 

disulfide bond (S — S) spanning approximately 60 amino instructions (Stratagene. La Jolla, Calif.). The pComb 

acid residues. The symbol CHO stands for carbohydrate. expression vector is prepared from Lambda Hc2 and 

The V region of the heavy (H) chain (V w ) resembles V L in Lambda Lc2 which do not contain V^-coding or V^-coding 

having three hypervariable CDR (not shown). DNA homologs. The in vivo excision protocol moved the 

FIG. 2B-1 is a diagraxnmatic sketch of a human light 60 cloned insert from the Lambda Hc2 and Lc2 vectors into a 

(Kappa) chain (Panel 1). Numbering is from the N-teirninus phagemid vector. The resultant phagemids contained the 

on the left to the C-terminus on the right. Note the intrachain same nucleotide sequences for antibody fragment cloning 

disulfide bond (S — S) spanning about the same number of and expression as did the parent vectors. Hc2 and Lc2 

amino acid residues in the V L and C L domains. phagemid expression vectors were separately restriction 

FIG. 2B-2 shows the locations of the complementarity- 65 digested with Sea I and EcoR L The linearized phagemids 

determining regions (CDR) in the domain. Segments were ligated via the Sea I and EcoR I cohesive tennini to 

outside the CDR are the framework segments (FR). form the dicistronic (combinatorial) vector. pComb.- 
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FIG. 8 illustrates a schematic diagram of the composition 
of pCBAK8-2b phagemid vector, the pathway for Fab 
assembly and incorporation in phage coat The vector carries 
the chloramphenicol acetyl transferase (CAT) marker gene 
in addition to the nucleotide residue sequences encoding the 
Fd-cpVm fusion protein and the kappa chain. The f 1 phage 
origin of replication facilitates the generation of single 
stranded phagemid. The isopropyl thiogalactopyranoside 
(TJPTG) induced expression of a dicistronic message encod- 
ing the Fd-cpVm fusion (V H . C m . cpVTJI) and the light 
chain (V^, leads to the formation of heavy and light 
chains. Each chain is delivered to the periplasmic space by 
the pelB target sequence, which is subsequently cleaved. 
The heavy chain is anchored in the membrane by cpVTH 
fusion while the light chain is secreted into the periplasm. 
The heavy chain in the presence of light chain assembles to 
form Fab molecules. The Fabs are incorporated into phage 
particles via cpVm (black dots). 

FIG. 9 illustrates the electron micrographic localization of 
5-7 am colloidal gold particles coated with NPN-BSA 
conjugate along the surface of filamentous phage, and from 
phage emerging from a bacterial cell. Panel 9A shows 
filamentous phage emerging from the surface of the bacterial 
cell specifically labelled with the colloidal gold particles 
coated with BSA-NPN antigen. Panel 9B shows a portion of 
a mature filamentous phage on the length of which is 
exhibited the labelling of antigen binding sites. 

FIG. 10 illustrates the results of a two-site ELISA for 
assaying for the presence and function of Fab antibody 
attached to the surface of bacteriophage particles as 
described in Example 4b. For expression of Fab antibody on 
phage surfaces. XLl-Blue cells were transformed with the 
phagemid expression vector, pCBAK8-2b. The inducer, 
isopropyl thiogalactopyranoside (TPTG). was admixed with 
the bacterial suspension at a final concentration of 1 mM for 
one hour. Helper phage was then admixed with the bacterial 
suspension to initiate the generation of copies of the sense 
strand of the phagemid DNA. After a two hour maintenance 
period, bacterial supematants containing bacteriophage par- 
ticles were collected for assaying in ELISA. 

Specific titratable binding of NFN-Fab-expressing bacte- 
riophage particles to NFN-coated plates was exhibited. No 
binding was detected with helper phage alone. 

FIG. 11 illustrates the inhibition of NPN-Fab expressing 
bacteriophage to NFN antigen-coated plates with the addi- 
tion of increasing amounts of free hapten. The assays were 
performed as described in FIG. 10. Complete inhibition of 
binding was observed with 5 ng of added free NPN hapten. 

FIG. 12 illustrates schematically the process of 
mutagenizing the CDR3 region of a heavy chain fragment 
resulting in an alteration of binding specificity. The oligo- 
nucleotide primers are indicated by black bars. The process 
is described in Example 6. 

DETAILED DESCRIPTION OF THE 
INVENTION 

A. Definitions 

Amino Acid Residue: An amino acid formed upon chemi- 
cal digestion (hydrolysis) of a polypeptide at its peptide 
linkages. The amino acid residues described herein are 
preferably in the 4 *L W isomeric form. However, residues in 
the "D M isomeric form can be substituted for any I^amino 
acid residue, as long as the desired functional property is 
retained by the polypeptide. NH 2 refers to the free amino 
group present at the amino terminus of a polypeptide. 
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COOH refers to the free carboxy group present at the 
carboxy terminus of a polypeptide. In keeping with standard 
polypeptide nomenclature (described in J. Biol. Chem^ 
2433552-59 (1969) and adopted at 37 C.F.R. 1.822(b)(2)). 
5 abbreviations for amino acid residues are shown in the 
following Table of Correspondence: 

TABLE OF CORRESPONDENCE 



SYMBOL 



20 



25 



1 -Letter 


3-Lcrtcr 


AMINO ACID 


Y 


TVr 


tyrosine 


G 


Gly 


glycine 


F 


Fbe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


He 


isoleucine 


L 


Leu 


leucine 


T 


Thr 




V 


Val 


y aline 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histkfine 


Q 


Gin 


ghitsuninc 


E 


Glu 


glutamic acid 


2 


Gtx 


Ghi and/or Gin 


W 


Tip 


tryptophan 


R 


Arg 


argm^oe 


D 


Asp 


aspartic acid 


N 


AfiQ 


asparagjne 


B 


Asx 


Asn aocVor Asp 


C 


Cys 


cysteine 


f 


Xaa 


Unknown or other 



It should be noted that ail amino acid residue sequences 
represented herein by formulae have a left-to-right orienta- 

35 tion in the conventional direction of amino terminus to 
carboxy terminus. In addition, the phrase "amino acid resi- 
due" is broadly defined to include the amino acids listed in 
the Table of Correspondence and modified and unusual 
amino acids, such as those listed in 37 C.F.R. 1.822(b)(4), 

^ and incorporated herein by reference. Furthermore, it should 
be noted mat a dash at the beginning or end of an amino acid 
residue sequence indicates a peptide bond to a further 
sequence of one or more amino acid residues or a covalent 
bond to an amines terminal group such as NH 2 or acetyl or 

45 to a carboxy-terminal group such as COOH. 

Nucleotide: A monomelic unit of DNA or RNA consisting 
of a sugar moiety (pentose), a phosphate, and a nitrogenous 
heterocyclic base. The base is linked to the sugar moiety via 
the giycosidic carbon (1' carbon of the pentose) and that 

5Q combination of base and sugar is a nucleoside. When the 
nucleoside contains a phosphate group bonded to the 3' or 5' 
position of the pentose it is referred to as a nucleotide. A 
sequence of operatively linked nucleotides is typically 
referred to herein as a **base sequence" or "nucleotide 

55 sequence", and their grammatical equivalents, and is repre- 
sented herein by a formula whose left to right orientation is 
in the conventional direction of 5-terminus to 3'-terminus. 

Base Pair (bp): A partnership of adenine (A) with thymine 
(T), or of cytosine (C) with guanine (G) in a double stranded 

^0 DNA molecule. In RNA. uracil (U) is substituted for thym- 
ine. 

Nucleic Acid: A polymer of nucleotides, either single or 
double stranded. 

Polynucleotide: a polymer of single or double stranded 
65 nucleotides. As used herein "polynucleotide** and its gram- 
matical equivalents, will include the full range of nucleic 
acids. A polynucleotide will typically refer to a nucleic acid 
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molecule comprised of a linear strand of two or more 
deoxyribonucleoudes and/or ribonucleotides. The exact size 
will depend on many factors, which in turn depends on the 
ultimate conditions of use, as is well known in the art The 
polynucleotides of the present invention include primers, 
probes. RNA/DNA segments, oligonucleotides or "oligos" 
(relatively short polynucleotides), genes, vectors, plasmids, 
and the like. 

Gene: A nucleic acid whose nucleotide sequence codes for 
an RNA or polypeptide. A gene can be either RNA or DNA. 

Duplex DNA: a double-stranded nucleic acid molecule 
comprising two strands of substantially complementary 
polynucleotides held together by one or more hydrogen 
bonds between each of the complementary bases present in 
a base pair of the duplex. Because the nucleotides that form 
a base pair can be either a ribonucleotide base or a deox- 
yribonucleotide base, the phrase "duplex DNA* refers to 
either a DNA-DNA duplex comprising two DNA strands (ds 
DNA). or an RNA-DNA duplex comprising one DNA and 
one RNA strand. 

Complementary Bases: Nucleotides mat normally pair up 
when DNA or RNA adopts a double stranded configuration. 

Complementary Nucleotide Sequence: A sequence of 
nucleotides in a single-stranded molecule of DNA or RNA 
that is sufficiently complementary to that on another single 
strand to specifically hybridize to it with consequent hydro- 
gen bonding. 

Conserved: A nucleotide sequence is conserved with 
respect to a preselected (reference) sequence if it non- 
randomly hybridizes to an exact complement of the prese- 
lected sequence. 

Hybridization: The pairing of substantially complemen- 
tary nucleotide sequences (strands of nucleic acid) to form 
a duplex or heteroduplex by the establishment of hydrogen 
bonds between complementary base pairs. It is a specific, Le. 
non-random, interaction between two complementary poly- 
nucleotides that can be competitively inhibited. 

Nucleotide Analog: A purine or pyrimidine nucleotide 
that differs structurally from A. T. G. C, or U. but is 
sufficiently similar to substitute for the normal nucleotide in 
a nucleic acid molecule. 

DNA Homolog: Is a nucleic acid having a preselected 
conserved nucleotide sequence and a sequence coding for a 
receptor capable of binding a preselected ligand. 

Recombinant DNA (rDNA) molecule: a DNA molecule 
produced by operatively linking two DNA segments. Thus, 
a recombinant DNA molecule is a hybrid DNA molecule 
comprising at least two nucleotide sequences not normally 
found together in nature. rDNA's not having a common 
biological origin. Le„ evolutionarily different, are said to be 
"heterologous". 

Vector: a rDNA molecule capable of autonomous repli- 
cation in a cell and to which a DNA segment, e.g.. gene or 
polynucleotide, can be operatively linked so as to bring 
about replication of the attached segment Vectors capable of 
directing the expression of genes encoding for one or more 
proteins are referred to herein as "expression vectors". 
Particularly important vectors allow cloning of cDNA 
(complementary DNA) from mRNAs produced using 
reverse transcriptase. 

Receptor A receptor is a molecule, such as a protein, 
glycoprotein and the like, that can specifically (non- 
randomly) bind to another molecule- 
Antibody: The term antibody in its various grammatical 
forms is used herein to refer to immunoglobulin molecules 
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and immunologically active portions of immunoglobulin 
molecules. Le.. molecules that contain an antibody combin- 
ing site or paratope. Exemplary antibody molecules are 
intact immunoglobulin molecules, substantially intact 

5 immunoglobulin molecules and portions of an immunoglo- 
bulin molecule, including those portions known in the art as 
Fab. Fab*. F(ab') 2 and F(v). 

Antibody Combining Site: An antibody combining site is 
that structural portion of an antibody molecule comprised of 

1° a heavy and light chain variable and hypervariable regions 
that specifically binds (immunoreacts with) an antigen. The 
term immunoreact in its various forms means specific bind- 
ing between an antigenic determinant-containing molecule 
and a molecule containing an antibody combining site such 

15 as a whole antibody molecule or a portion thereof. 

Monoclonal Antibody: The phrase monoclonal antibody 
in its various grammatical forms refers to a population of 
antibody molecules that contains only one species of anti- 
body combining site capable of immunoreacting with a 

20 particular antigen. A monoclonal antibody thus typically 
displays a single binding affinity for any antigen with which 
it immunoreacts. A monoclonal antibody may therefore 
contain an antibody molecule having a plurality of antibody 
combining sites, each immunospecific for a different 

25 antigen, e.g.. a bispecific monoclonal antibody. 

Fusion Protein: A protein comprised of at least two 
polypeptides and a linking sequence to operatively link the 
two polypeptides into one continuous polypeptide. The two 

30 polypeptides linked in a fusion protein are typically derived 
from two independent sources, and therefore a fusion pro- 
tein comprises two linked polypeptides not normally found 
linked in nature. 

Upstream: In the direction opposite to the direction of 

35 DNA transcription, and therefore going from 5* to 3* on the 
non-coding strand, or 3* to 5* on the mRNA. 

Downstream: Further along a DNA sequence in the direc- 
tion of sequence transcription or read out that is traveling in 
a J- to S'-direction along the non-coding strand of the DNA 

40 or 5'- to 3 -direction along the RNA transcript. 

Cistron: Sequence of nucleotides in a DNA molecule 
coding for an amino acid residue sequence and including 
upstream and downstream DNA expression control ele- 
ments. 

43 Stop Codon: Any of three codons that do not code for an 
amino acid, but instead cause termination of protein syn- 
thesis. They are UAG. UAA and UGA and are also referred 
to as a nonsense or termination codon. 

Leader Polypeptide: A short length of amino acid 
sequence at the amino end of a protein, which carries or 
directs the protein through the inner membrane and so 
ensures its eventual secretion into the periplasmic space and 
perhaps beyond. The leader sequence peptide is commonly 
55 removed before the protein becomes active. 

Reading Frame: Particular sequence of contiguous nucle- 
otide triplets (codons) employed in translation. The reading 
frame depends on the location of the translation initiation 
codon. 

60 B. Filamentous Phage 

The present invention contemplates a filamentous phage 
comprising a matrix of cpVUI proteins encapsulating a 
genome encoding first and second polypeptides. The phage 
65 further comprises a heteromeric receptor comprised of the 
first and second polypeptides surface-integrated into the 
matrix via a cpVDI membrane anchor domain fused to at 
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least one of the first or second polypeptides. Preferably, the 
first and second polypeptides are W H and V L proteins, 
respectively. 

The first and second polypeptides are capable of autog- 
enous assembly into a functional receptor, which is 
expressed on the outer surface in a manner accessible to 
ligand. i.e. they are surface-integrated into the phage. 
Typically, the receptor is comprised of a linking polypeptide 
that contains the cpVTQ membrane anchor domain, such as 
a polypeptide described in Section C. and a non-linking 
polypeptides). 

Because the receptor is linked to the phage in a surface 
accessible manner, the phage can be advantageously used as 
a solid-phase affinity sorbenL In preferred embodiments, the 
phage are linked, preferably removably linked, to a solid 
(aqueous insoluble) matrix such as agarose, cellulose, syn- 
thetic reins, polysaccharides and the like. For example, 
transformants shedding the phage can be applied to and 
retained in a column and maintained under conditions mat 
support shedding of the phage. An aqueous composition 
containing a ligand that binds to the receptor expressed by 
the phage is then passed through the column at a predeter- 
mined rate and under receptor-binding conditions to form a 
solid-phase receptor-ligand complex. The column is then 
washed to remove unbound material, leaving the ligand 
bound to the solid- phase phage. The ligand can then be 
removed and recovered by washing the column with a buffer 
that promotes dissociation of the receptor-ligand complex. 

Alternatively, purified phage can be admixed with a 
aqueous solution containing the ligand to be affinity purified. 
The receptor/ligand binding reaction admixture thus formed 
is maintained for a time period and under binding conditions 
sufficient for a phage-linked receptor-ligand complex to 
form. The phage-bound ligand (ligand-bearing phage) are 
then separated and recovered from the unbound materials, 
such as by centrifugation. electrophoresis, precipitation, and 
the like. 

C DNA Expression Vectors 

A vector of the present invention is a recombinant DNA 
(rDNA) molecule adapted for receiving and expressing 
translatable DNA sequences in the form of a fusion protein 
containing a filamentous phage gene VHI membrane anchor 
domain and a prokaryotic secretion signal domain. The 
vector comprises a cassette that includes upstream and 
downstream translatable DNA sequences operatively linked 
via a sequence of nucleotides adapted for directional liga- 
tion. The upstream translatable sequence encodes the secre- 
tion signal. The downstream translatable sequence encodes 
the filamentous phage membrane anchor. The cassette pref- 
erably includes DNA expression control sequences for 
expressing the fusion protein that is produced when a 
translatable DNA sequence is directionally inserted into the 
cassette via the sequence of nucleotides adapted for direc- 
tional ligation. 

An expression vector is characterized as being capable of 
expressing, in a compatible host, a structural gene product 
such as a fusion protein of the present invention. 

As used herein, the term •Rector** refers to a nucleic acid 
molecule capable of transporting between different genetic 
environments another nucleic acid to which it has been 
operatively linked. Preferred vectors are those capable of 
autonomous replication and expression of structural gene 
products present in the DNA segments to which they are 
operatively linked. 

As used herein with regard to DNA sequences or 
segments, the phrase "operatively linked** means the 
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sequences or segments have been covalently joined into one 
shard of DNA. whether in single or double stranded form. 

The choice of vector to which a cassette of this invention 
is operatively linked depends directly, as is well known in 

5 the art. on the functional properties desired, e.g.. vector 
replication and protein expression, and the host cell to be 
transformed, these being limitations inherent in the art of 
constructing recombinant DNA molecules. 
In preferred embodiments, the vector utilized includes a 

10 prokaryotic replicon Le.. a DNA sequence haying the ability 
to direct autonomous replication and maintenance of the 
recombinant DNA molecule extra chromosomally in a 
prokaryotic host cell, such as a bacterial host cell, trans- 
formed therewith. Such replicons are well known in the art 

15 In addition, those embodiments that include a prokaryotic 
replicon also include a gene whose expression confers a 
selective advantage, such as drug resistance, to a bacterial 
host transformed therewith. Typical bacterial drug resistance 
genes are those that confer resistance to ampicillin or 

20 tetracycline. Vectors typically also contain convenient 
restriction sites for insertion of translatable DNA sequences. 
Exemplary vectors are the plasmids pUC8. pUC9, pBR322. 
and pBR329 available from BioRad Laboratories. 
(Richmond, Calif.) and pPL and pKK223 available from 

25 Pharmacia, (Piscataway. N.J.). 

A sequence of nucleotides adapted for directional ligation, 
i.e.. a polylinker. is a region of the DNA expression vector 
that (1) operatively links for replication and transport the 
upstream and downstream translatable DNA sequences and 

30 (2) provides a site or means for directional ligation of a DNA 
sequence into the vector. Typically, a directional polylinker 
is a sequence of nucleotides that defines two or more 
restriction endonudease recognition sequences, or restric- 
tion sites. Upon restriction cleavage, the two sites yield 

35 cohesive termini to which a translatable DNA sequence can 
be ligated to the DNA expression vector. Preferably, the two 
restriction sites provide, upon restriction cleavage, cohesive 
termini that are non-complementary and thereby permit 
directional insertion of a translatable DNA sequence into the 

40 cassette. In one embodiment, the directional ligation means 
is provided by nucleotides present in the upstream translat- 
able DNA sequence, downstream translatable DNA 
sequence, or both. In another embodiment, the sequence of 
nucleotides adapted for directional ligation comprises a 

45 sequence of nucleotides that defines multiple directional 
cloning means. Where the sequence of nucleotides adapted 
for directional Ligation defines numerous restriction sites, it 
is referred to as a multiple cloning site. 

^ A translatable DNA sequence is a linear series of nucle- 
otides that provide an uninterrupted series of at least 8 
codons that encode a polypeptide in one reading frame. 

An upstream translatable DNA sequence encodes a 
prokaryotic secretion signal. The secretion signal is a leader 

55 peptide domain of protein that targets the protein to the 
periplasmic membrane of gram negative bacteria, 

A preferred secretion signal is apelB secretion signal. The 
predicted amino acid residue sequences of the secretion 
signal domain from two pelB gene product variants from 

60 Erwinia carotova are shown in Table 1 as described by Lei. 
et aL. Nature, 331 :543-546 (1988). A particularly preferred 
pelB secretion signal is also shown in Table 1. 

The leader sequence of the pelB protein has previously 
been used as a secretion signal for fusion proteins. Better et 

65 ai. Science, 240:1041-1043 (1988); Sastry et al.. Proc. NatL 
Acad. ScL USA, 86:5728-5732 (1989); and Mullinax et al.. 
Proc. NatL Acad. Set. USA, 87:8095-8099 (1990). 
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Amino acid residue sequences for other secretion signal 
polypeptide domains fromf. coli useful in this invention are 
also listed in Table 1. Oliver, In Neidhard. F. C. (ed.), 
Escherichia coli and Salmonella Typhimuriiun* American 
Society for Microbiology. Washington. D.C.. 1:56-69 
(1987). 

A translatable DNA sequence encoding the pelB secretion 
signal having the amino acid residue sequence shown in 
SEQ. ID. NO. 5 is a preferred DNA sequence for inclusion 
in a DNA expression vector of 

TABLE 1 



Leader Sequences 



SEQ. 

ID. NO. Type Ammo Acid Residue Sequence 



(5) PelB 1 MetLysTytt^ulxuProThrAlaAlaAlaGlyLeuLeu 

LnJ-ruAlaAlaGlnProAlaMct 

(6) PelB 3 MetLy»Tyil^uUuI^ThrAlaAlaAl8Gl)l*uLeu 

bn^uAiaAbGlnProAlaGlnProAiaMctAla 

(7) PeD3 J MctLy^rl^^IkThrP^oIk^AJaAlaG)yl^uLcu 

LcuAlaPbcSctGhiTyrScTLcuAla 

(8) MaE 4 MctLysIleXysthiGlyAlaAisIleLcuAlaLeuScr 

AlaLeuThrThrMetMetPbeSerAlaSerAlaLeuAla 
Lyslk 

(9) OmpF* MctMctLysArgAmDeLeuAlaVoIIkValProAla 

LcuLcuValAiaGiyThrAlaAsnAlaAlaGlu 
(10) PboA 4 Med-ysGlnSerTlirIkAlaUuAM>ei^uPioLeu 

UiiPheThiProVanM-ysAlaAigThr 
(U) BU 4 MctS^DcGliiHisPbeArxValAlal^iJlcProPhc 

PteAlaAlaPbeCysLcuProVaiPbcAlaHisPro 

(12) LamB 4 MetMetlkTM^iArgLysLeii^ 

WAUAlaGlyV^etSciAleGlnAla^tAlaVal 
Asp 

(13) Lpp* MetLysAlaTTid^ysLcuVa^uGlyAlaVallkUu 

GlySeiThiLeuLeiiAlaGlyCysScr 

(14) cpvm 5 XfctLysLysSeiLeuViII^iiLysAJaSerValAlaVal 

AlaThri^ValProMctLcuSerPbcAla 

(15) qjlir 4 Me tLysLysLeuLcuPhcAJalk ProLeu VaTValPro 

PheTyrSerHisSer 



I pelB used in this invemioa 

2 pdfl fiom Erwinia carvtmora gene 

J pdB from Erwinia carotovora EC 16 gene 

*kader sequences from £ coli 

'leader sequence for cpVTH 

A keder sequence for epm 

this invention. 

A downstream translatable DNA sequence encodes a 
filamentous phage membrane anchor. Preferred membrane 
anchors are obtainable from filamentous phage M13. fl. fd, 
and the like, Referred membrane anchor domains are found 
in the coat proteins encoded by gene HI and gene Vm. Thus, 
a downstream translatable DNA sequence encodes an amino 
acid residue sequence that corresponds, and preferably is 
identical, to the membrane anchor domain of either a fila- 
mentous phage gene HI or gene Vm coat protein. 

The membrane anchor domain of a filamentous phage 
coat protein is the carboxy terminal region .of the coat 
protein and includes a region of hydrophobic amino acid 
residues for spanning a lipid bilayer membrane, and a region 
of charged amino acid residues normally found at the 
cytoplasmic face of the membrane and extending away from 
the membrane. In the phage fl. gene VIE coat protein's 
membrane spanning region comprises residue Trp-26 
through Lys-40. and the cytoplasmic region comprises the 
carboxy- terminal 11 residues from 41 to 52. Ohkawa et al., 
J. Biol Chem., 256:9951-9958 (1981). 

The amino acid residue sequence of a preferred mem- 
brane anchor domain derived from the M13 filamentous 
p&age gene ID coat protein (also designated cpIH) has a 
sequence shown in SE<J. ID. NO. 16 from residue 1 to 
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residue 211. Gene ID coat protein is present on a mature 
filamentous phage at one end of the phage particle with 
typically about 4 to 6 copies of the coat protein. 
The amino acid residue sequence of a preferred mem- 

5 brane anchor domain derived from the M13 filamentous 
phage gene VTU coat protein (also designated cpvm) has a 
sequence shown in SEQ. ID. NO. 17 from residue 1 to 
residue 50. Gene VITJ coat protein is present on a mature 
filamentous phage over the majority of the phage particle 
with typically about 2500 to 3000 copies of the coat protein. 

1 For detailed descriptions of the structure of filamentous 
phage particles, their coat proteins and particle assembly, see 
the reviews by Rached et al.. Microbiol fov, 50:401-427 
(1986); and Model et aL. in "The Bacteriophages: Vol. 2**. R. 
Calendar, ed. Plenum Publishing Co.. pp. 375-456. (1988). 

15 A cassette in a DNA expression vector of this invention is 
the region of the vector that forms, upon insertion of a 
translatable DNA sequence, a sequence of nucleotides 
capable of expressing, in an appropriate host a fusion 
protein of this invention. The express ion- competent 

20 sequence of nucleotides is referred to as a cistron. Thus, the 
cassette comprises DNA expression control elements opera- 
tively linked to the upstream and downstream translatable 
DNA sequences. A cistron is formed when a translatable 
DNA sequence is directionally inserted (directionally 

25 ligated) between the upstream and downstream sequences 
via the sequence of nucleotides adapted for that purpose. 
The resulting three translatable DNA sequences, namely the 
upstream, the inserted and the downstream sequences, are all 
operatively linked in the same reading frame. 

DNA expression control sequences comprise a set of 

30 DNA expression signals for expressing a structural gene 
product and include both 5' and 3* elements, as is well 
known, operatively linked to the cistron such that the cistron 
is able to express a structural gene product. The 5' control 
sequences define a promoter for initiating transcription and 

35 a ribosome binding site operatively linked at the 5 1 terminus 
of the upstream translatable DNA sequence. 

To achieve high levels of gene expression in E. coli. it is 
necessary to use not only strong promoters to generate large 
quantities of mRNA. but also ribosome binding sites to 

40 ensure that the mRNA is efficiently translated. In E. coli. the 
ribosome binding site includes an initiation codon (AUG) 
and a sequence 3-9 nucleotides long located 3-11 nucle- 
otides upstream from the initiation codon [Shine et al.. 
Nature, 254 04 (1975)] The sequence. AGGAGGU. which is 

45 called the Shine-Dalgarno (SD) sequence, is complementary 
to the 3' end of £. coli 16S mRNA. Binding of the ribosome 
to mRNA and the sequence at the 3' end of the mRNA can 
be affected by several factors: 

(i) The degree of complementarity between the SD 
50 sequence and 3' end of the 16S tRNA. 

(ii) The spacing and possibly the DNA sequence lying 
between the SD sequence and the AUG [Roberts et al., 
Proc. Natl Acad. ScL USA, 76:760 (1979a); Roberts et 
al., Proc. NatL Acad ScL USA, 76:5596 (1979b); 

55 Guarente et al.. Science, 209:1428 (1980); and 
Guarente et aL, Cell, 20:543 (1980).] Optimization is 
achieved by measuring the level of expression of genes 
in plasmids in which this spacing is systematically 
altered. Comparison of different mRNAs shows that 

60 there are statistically preferred sequences from posi- 
tions -20 to +13 (where the A of the AUG is position 
0) [Gold et al.. Anna. Rev. Microbiol, 35:365 (1981)]. 
Leader sequences have been shown to influence trans- 
lation dramatically (Roberts et al.. 1979 a. b supra). 

65 (iii) The nucleotide sequence following the AUG. which 
affects ribosome binding (Taniguchi et al., J. MoL Biol., 
118:533 (1978)]. 
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Useful ribosomc binding sites are shown in Tabic 2 below. replication. Particularly preferred is a filamentous phage 

origin of replication having a sequence shown in SEQ. ID. 
TABLE 2 NO. 117 and described by Short et al.. Nucl. Acids Res^ 

- 16:7583-7600 (1988). Preferred DNA expression vectors 

Ritosome B Wviif^ sites* 5 ^ e tne djcistronic expression vectors pCOMB8 and 

pCOMB3 described in Example lb(i) and lb(ii). respec- 
tively. 



SEQ. 
ID. NO. 



i. (18) 5* AAUCUU GGAGG CUUTjiJU UAyGG UUCGiJiJCU D. Polypeptides 

1 (19) 5' UAAC UAAGGA UGAAAUOCgg^CUAAGACA 

3. (20) 5' uccuaggagguuugaccuaugcgagcuuuu 10 in another embodiment, the present mvenuon contera- 

4. (2i) 5* augu AC uaagg AtxnJUGUAUGO aac aacgc plates a polypeptide comprisijig an insert domain flanked by 
"~ r . . ! T • • r . ^ mWi an ammo-terminal secretion signal domain and a carboxy- 

•Scqucnces of initiation regions for protein synthesis in tour phage mKNA ° , . * 

TOtec^are im^Uoed. terminal filamentous phage coat protein membrane anchor 

AUG = initiation codon (double underlined) domain. 

z = Qfl 1 ^^ V ™* m ' 15 drably, the polypeptide is a fusion protein having a 

3. = Phage Ri7 gene-A protein receptor domain comprised of an amino acid residue 

4. = Phage lambda gene-cro protein sequence that defines the ligand binding domain of a recep- 

^ ^, . . . , . . *,™;„^v„, tor protein positioned between a prokaryotic secretion signal 

The 3 control sequences define at least one termination , F . y , /7 vtttx ™^Kr~«- ,r,^™- 

(stop) codon in frarne with and operatively linked to the M omainandagcncM^ 

downstream translatable DNA sequence. domaon . In preferred 

Thus, a DNA expression vector of this invention provides polypeptide chain of a heterodimenc receptor Insofar as the 

a system for cloning translatable DNA sequences into the polypeptide has a receptor domain, it is also referred to 

cassette portion of the vector to produce a cistron capable of herein as a receptor. In other preferred embodiments the 

expressing a fusion protein of this invention. secretion signal domain is a pelB secretion signal as 

In preferred emtjpdiments. a DNA expression vector described herein, 

provides a system for independently cloning two translatable Preferred heterodimeric receptors include 

DNA sequences into two separate cassettes present in the immunoglobulins, major histocompatibility antigens of 

vector, to form two separate cistrons for expressing both C ^ 2LSS j or jx lymphocyte receptors, integrins and the like 

polypeptides of a heterodimenc receptor, or the ligand heterodimeric receptors. 

^ 7! : * A ^ „5\ Ai^r+r^ir. ^nr^c amino acid residue sequence that can be represented by the 

expressing two cistrons is referred to as a dtcistromc expres- ~^ t V . /? ^ , ? _ 

sion vector formula, shown in the direction of amino- to carboxy 

Thus . a preferred DNA expression vector of this invention terminus: 

comprises. Ln addition to the cassette previously described in 35 ^ _<win _v— <x\,— Z—COOH, (Fi) 

detail, a second cassette for expressing a second fusion 2 

protein. The second cassette includes a third translatable where O represents an amino acid residue sequence defining 

DNA sequence that encodes a secretion signal, as defined a secretion signal, U represents a first spacer polypeptide. V 

herein before, operatively linked at its 3* terminus via a represents an amino acid residue sequence defining a recep- 

sequence of nucleotides adapted for directional ligation to a ^ tor fornain, X represents a second spacer polypeptide, and Z 

downstream DNA sequence defining at least one stop codon. represents an amino acid residue sequence defining a fila- 

The third translatable DNA sequence is operatively linked at mentous phage ^ prot ein membrane anchor, with the 

its 5' terminus to DNA expression control sequences forming ^ m {$ me ifl Q m { such ^ wnC[1 m is Q 

the y elements defined above ~ d » U is not present, and when m is 1. U is present, and n is 0 

capable, upon insertion of a translatable DNA sequence, of 45 ^^at when n is O. X is not present and when n is 

expressing the second fusion protein. . 

In a preferred embodiment, a DNA expression vector is u * *f P* csent - 

designed^ convenient rnanipulation in the form of a * *e formula (Fl) the secretion signal and the filamen- 
mamcntous phage particle according to the teachings of the tons phage coat protein membrane anchor are as defined 
present invention. Itathis embodiment, a DNA expression herein above. Particularly preferred is a polypeptide accord- 
vector further contains a nucleotide sequence that defines a » ing to formula (Fl) where Z defines the gene VIE membrane 
filamentous phage origin of replication such that the vector, anchor as described herein. In another preferred embodi- 
upon presentation of the appropriate genetic ment the secretion signal is the pelB secretion signal, 
complementation, can replicate as a filamentous phage in In one embodiment. V is an amino acid residue sequence 
single stranded replicative form and be packaged into fila- that defines the ligand binding domain of a chain of a 
mentous phage particles. This feature provides the ability of 55 heterodimeric receptor molecule, and preferably is an immu- 
the DNA expression vector to be packaged into phage noglobulin variable region polypeptide. In a particularly 
particles for subsequent segregation of the particle, and preferred polypeptide V is a V w or polypeptide. Most 
vector contained therein, away from other particles that preferred is a polypeptide where Vis an irnrnunoglobulin V„ 
comprise a population of phage particles. polypeptide, and ra and n arc both zero. 

A filamentous phage origin of replication is a region of the 60 In another embodiment. U or X can define a proteolytic 

phage genome, as is well known, that defines sites for cleavage site, such as the sequence of amino acids found in 

initiation of replication, termination of replication and pack- a precursor protein, such as prothrombin, factor X and the 

aging of the replicative form produced by replication. See. like, that defines the site of cleavage of the protein. A fusion 

for example, Rasched et al., Microbiol. /?en, 50:401-427 protein having a cleavage site provides a means to purify the 

(1986); and Horiuchi. /. MoL BioL 188215-223 (1986). 65 protein away from the phage particle to which it is attached. 

A preferred filamentous phage origin of replication for use The polypeptide spacers U and X can each have any 

in the present invention is a M 13. f 1 or fd phage origin of sequence of arnino acid residues of from about 1 to 6 amino 
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acid residues in length. Typically the spacer residues are amino acid residues, more usually fewer than about 120 

present in a polypeptide to accommodate the continuous amino acid residues, while normally having greater than 60 

reading frame that is required when a polypeptide is pro- amino acid residues, usually greater than about 95 amino 

rrvcr;stvSr h ^ usbgaDNAexprcs * , s^si^c^^k 

.on tA a *«Jc as ^^2^S^S^^ a ^ 

ability to be competitively inhibited, a preselected or pre- f c £ cd 

determined ligand such as an antigen, enzymatic substrate c ™ e add sequcnccs wiu vary widcly . 

and the like. In one embodiment, a receptor of this invention 10 d din ±c particular id i otypc involved. Usually, 

is a ligand binding polypeptide that forms an antigen binding ^ will ^ ^ least cyst eines separated by from about 

site which specifically binds to a deselected antigen to form ^ tQ ?5 amino ^ and joined by a disulfide 

a complex having a sufficiently strong binding between the yhe ^,^^5 pro duced by the subject invention will 

antigen and the binding site for the complex to be isolated. nonnaJ] ^ substanti al copies of idiotypes of the variable 

When the receptor is an antigen binding poly^ptide its ioQS of the heavy and/or lighl cha ins of 

affinity or avidity is generally greater than IF M mow irnmunog i 0 bulins. but in some situations a polypeptide may 

usually greater than 10 6 and preferably greater than 10° M . rafldom mutations in acid residue sequences 

In another embodiment, a receptor of the subject inven- ^ Qida %o advanlagcous i y imp-ove the desired activity, 

tion binds a substrate and catalyzes the formation of a ^ some situations ft {s 6eskaac t0 provide for covalent 

product from the substrate. While the topology of the ligand 20 ^ ^ v ^ ^ y ^ volypcpdAcim which caD be 

binding site of a catalytic receptor is probably more impor- accomplished by prov iding cysteine resides at the carboxyl 

tant for its preselected activity than its affinity (association tamM Thc ^ lypcptide will normally be prepared free of 

constant or pKa) for the substrate, the subject catalytic ^ i^^^^ aslant regions, however a small 

receptors have an association constant for the preselected of ^ } ^ ^ indudcd as a rcsuJt ^ ^ e 

substrate generally greater than 10^ M . more usually 25 advantagcous 5^^011 of DNA synthesis primers. The D 

greater than 10 5 M~ or 10 6 M and preferably greater than ^ normally ^ k thc ^^pt of the V„. 

10 M~ . Typically the C terminus region of the V w and V t 

Preferably the receptor produced by thc subject invention pol ^L )tidcs wfll havc a &ca tcr variety of sequences than 

is heterodimeric and is therefore normally ^comprised of two ^ N ^ ^scd on the present strategy, can be 

different polypeptide chains, which together assume a con- 30 modificd t0 permit a variation of the normally 

formation having a binding affinity, or association constant occurring v ^ v , chains. A synthetic polynucleotide can 

for the preselected ligand that is different preferably higher. bc employed to vary 0 ne or more amino acid in a hyper- 

than the affinity or association constant of either of the var iable region 

polypeptides alone, i.e.. as monomers One or both of die ^ embodimcnts ±c inV ention contemplates a 

different polypeptide chains is derived from the variable 35 hetcrodimeric rec eptor molecule comprised of two polypep- 

region of the light and heavy chains of animmunoglobuhn. ^ ^ ^ qqc Qf which b a ^y^^tc of this 

Typically, polypeptides comprising the light (V J and heavy invention . preferably, the polypeptide comprises a receptor 

(V„) variable regions are employed together for binding the dcdycd from an i^unogiobujin variable chain, 

preselected ligand. more preferably a V„. More preferred is a heterodimeric 

A receptor produced by the subject invention can be 40 rece ^ r coding r £ceptor domains from both V„ and 

active in monomeric as well as multimenc forms, either cnj tf ns 
homomeric or heteromeric. preferably heterodimeric. For 

example. V H and Ugand binding polypeptide produced E. Methods for Producing a library 

by the present invention can be advantageously combined in 1. General Rationale 

the hetercKiimer to modulate the activity of either or to 45 In one embodiment the present invention provides a 

produce an activity unique to the heterodimer. system for the simultaneous cloning and screening of pre- 

The individual ligand polypeptides will be referred to as selected ligand-binding specificities from gene repertoires 

V w and V L and the heterodimer will be referred to as a Fv. using a single vector system. This system provides linkage 

However, it should be understood that a V H may contain in of cloning and screening methodologies and has two 

addition to the V w , substantially all or a portion of the heavy 50 requirements. First, that expression of the polypeptide 

chain constant region. Similarly, a V A may contain, in chains of a heterodimeric receptor in an in vitro expression 

addition 10 the V^. substantially all or a portion of the light host such as E. colt requires coexpression of the two 

chain constant region. A heterodimer comprised of a W H polypeptide chains in order that a functional heterodimeric 

containing a portion of the heavy chain constant region and receptor can .assemble to produce a receptor that binds 

a containing substantially all of the light chain constant 55 ligand. Second, that screening of isolated members of the 

region is termed a Fab fragment. The production of Fab can library for a preselected ligand-binding capacity requires a 

be advantageous in some situations because the additional means to correlate the binding capacity of an expressed 

constant region sequences contained in a Fab as compared to receptor molecule with a convenient means to isolate the 

a Fv can stabilize the and V L interaction. Such stabili- gene that encodes the member from thc library, 

zation can cause the Fab to have higher affinity for antigen. 60 Linkage of expression and screening is accomplished by 

In addition the Fab is more commonly used in the art and the combination of targeting of a fusion protein into the 

thus there are more commercial antibodies available to periplasm of a bacterial cell to allow assembly of a func- 

specifically recognize a Fab in screening procedures. tional receptor, and the targeting of a fusion protein onto the 

The individual V„ and V L polypeptides can bc produced coat of a filamentous phage particle during phage assembly 

in lengths equal to or substantially equal to their naturally 65 to allow for convenient screening of the library member of 

occurring lengths. However, in preferred embodiments, the interest. Periplasmic targeting is provided by the presence of 

V H and W L polypeptides will generally have fewer than 125 a secretion signal domain in a fusion protein of this inven- 
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tion. Targeting to a phage particle is provided by the 
presence of a filamentous phage coat protein membrane 
anchor domain in a fusion protein of this invention. 

The present invention describes in one embodiment a 
method for producing a library of DNA molecules, each 
DNA molecule comprising a cistron for expressing a fusion 
protein on the surface of a filamentous phage particle. The 
method comprises the steps of (a) forming a ligation admix- 
ture by combining in a ligation buffer (i) a repertoire of 
polypeptide encoding genes and (ii) a plurality of DNA 
expression vectors in linear form adapted to form a fusion 
protein expressing cistron. and (b) subjecting the admixture 
to ligation conditions for a time period sufficient for the 
repertoire of genes to become operatively linked (ligated) to 
the plurality of vectors to form the library. 

In this embodiment, the repertoire of polypeptide encod- 
ing genes are in the form of double-stranded (ds) DNA and 
each member of the repertoire has cohesive termini adapted 
for directional ligation. In addition, the plurality of DNA 
expression vectors are each linear DNA molecules having 
upstream and downstream cohesive termini that are (a) 
adapted for directionally receiving the polypeptide genes in 
a common reading frame, and (b) operatively linked to 
respective upstream and downstream translatable DNA 
sequences. The upstream translatable DNA sequence 
encodes a secretion signal, preferably a pelB secretion 
signal, and the downstream translatable DNA sequence 
encodes a filamentous phage coat protein membrane anchor 
as described herein for a polypeptide of this invention. The 
translatable DNA sequences are also operatively linked to 
respective upstream and downstream DNA expression con- 
trol sequences as denned for a DNA expression vector 
described herein. 

The library so produced can be utilized for expression and 
screening of the fusion proteins encoded by the resulting 
library of cistrons represented in the library by the expres- 
sion and screening methods described herein. 
2. Production of Gene Repertoires 

A gene repertoire is a collection of different genes, 
preferably polypep tide-encoding genes (polypeptide genes), 
and may be isolated from natural sources or can be generated 
artificially. Preferred gene repertoires are comprised of con- 
served genes. Particularly preferred gene repertoires com- 
prise either or both genes that code for the members of a 
dimeric receptor molecule. 

A gene repertoire useful in practicing the {resent inven- 
tion contains at least 10 3 . preferably at least 10 4 . more 
preferably at least 10 5 . and most preferably at least 10 7 
different genes. Methods for evaluating the diversity of a 
repertoire of genes is well known to one skilled in the art 

Thus, in one embodiment the present invention contem- 
plates a method of isolating a pair of genes coding for a 
dimeric receptor having a preselected activity from a rep- 
ertoire of conserved genes. Additionally, expressing the 
cloned pair of genes and isolating the resulting expressed 
dimeric receptor protein is also described. Preferably, the 
receptor will be a heterodimeric polypeptide capable of 
binding a ligand, such as an antibody molecule or immu- 
nologically active portion thereof, a cellular receptor, or a 
cellular adhesion protein coded for by one of the members 
of a family of conserved genes, i.e., genes containing a 
conserved nucleotide sequence of at least about 10 nucle- 
otides in length. 

Exemplary conserved gene families encoding different 
polypeptide chains of a dimeric receptor arc those coding for 
immunoglobulins, major histocompatibility complex anti- 
gens of class I or n. lymphocyte receptors, integrins and the 
like. 
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A gene can be identified as belonging to a repertoire of 
conserved genes using several methods. For example, an 
isolated gene may be used as a hybridization probe under 
low stringency conditions to detect other members of the 

5 repertoire of conserved genes present in genomic DNA 
using the methods described by Southern, Mol BioU 
98:503 (1975). If the gene used as a hybridization probe 
hybridizes to multiple restriction endonuclease fragments of 
the genome, that gene is a member of a repertoire of 

10 conserved genes. 
Immunoglobulins 

The immunoglobulins, or antibody molecules, are a large 
family of molecules that include several types of molecules, 
such as IgD. IgG, IgA. IgM and IgE. The antibody molecule 

15 is typically comprised of two heavy (H) and light (L) chains 
with both a variable (V) and constant (C) region present on 
each chain as shown in FIG. 1. Schematic diagrams of 
human IgG heavy chain and human kappa light chain are 
shown in FIGS. 2A and 2B. respectively. Several different 

20 regions of an immunoglobulin contain conserved sequences 
useful for isolating an immunoglobulin repertoire. Extensive 
amino acid and nucleic acid sequence data displaying exem- 
plary conserved sequences is compiled for immunoglobulin 
molecules by Kabat et al.. in Sequences of Proteins of 

25 Immunological Interest. National Institutes of Health. 
Bethesda. MA. 1987. 

The C region of the H chain defines the particular immu- 
noglobulin type. Therefore the selection of conserved 
sequences as defined herein from the C region of the H chain 

30 results in the preparation of a repertoire of immunoglobulin 
genes having members of the immunoglobulin type of the 
selected C region. 

The V region of the H or L chain typically comprises four 
framework (FR) regions each containing relatively lower 

35 degrees of variability that includes lengths of conserved 
sequences. The use of conserved sequences from the FR1 
and FR4 (J region) framework regions of the V H chain is a 
preferred exemplary embodiment and is described herein in 
the Examples. Framework regions are typically conserved 

40 across several or all immunoglobulin types and thus con- 
served sequences contained therein are particularly suited 
for preparing repertoires having several immunoglobulin 
types. 

Major Histocompatibility Complex 

45 The major histocompatibility complex (MHC) is a large 
genetic locus that encodes an extensive family of proteins 
that include several classes of molecules referred to as class 
I. class JJ or class III MHC molecules. Paul et al.. in 
Fundamental Immunology. Raven Press. N.Y., pp. 303-378 

50 (1984). 

Class I MHC molecules are a polymorphic group of 
transplantation antigens representing a conserved family in 
which the antigen is comprised of a heavy chain and a 
non-MHC encoded ligfit chain. The heavy chain includes 

55 several regions, termed the N. CI, C2, membrane and 
cytoplasmic regions. Conserved sequences useful in the 
present invention are found primarily in the N. CI and C2 
regions and are identified as continuous sequences of 
"invariant residues" in Kabat et aL. supra. 

60 Class II MHC molecules comprise a conserved family of 
polymorphic antigens that participate in immune respon- 
siveness and are comprised of an alpha and a beta chain. The 
genes coding for the alpha and beta chain each include 
several regions that contain conserved sequences suitable for 

65 producing MHC class n alpha or beta chain repertoires. 
Exemplary conserved nucleotide sequences include those 
coding for amino acid residues 26-30 of the A 1 region. 
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residues 161-170 of the A2 region and residues 195-206 of been recently challenged results in producing a repertoire 

the membrane region, all of the alpha chain. Conserved that is not biased towards the production of high affinity V w 

sequences are also present in the Bl, B2 and membrane and/or V L polypeptides. 

regions of the beta chain at nucleotide sequences coding for It should be noted the greater the genetic heterogeneity of 

amino acid residues 41^*5. 150-162 and 200-209. respec- 5 the population of cells for which the nucleic acids are 

tiveiy obtained, the greater the diversity of the immunological 

Lymphocyte Receptors and Cell Surface Antigens repertoire (comprising and V^-coding genes) that will 

Lymphocytes contain several families of proteins on their be made available for screening according to the method of 

cell surfaces including the T-cell receptor. Thy- 1 antigen and the present invention. Thus, cells from different individuals, 

numerous T-ceil surface antigens including the antigens 10 particularly those having an immunologically significant age 

defined by the monoclonal antibodies OKT4(leu3). OKT5/8 difference, and cells from individuals of different strains. 

(Ieu2), OKT3. OKT1 (leul), OKT 11 (leu5) OKT6 and races or species can be advantageously combined to increase 
OKT9. Paul, supra at pp. 458^179. . the heterogeneity (diversity) of a repertoire. 

The T-cell receptor is a term used for a family of antigen Thus, in one preferred embodiment, the source cells are 

binding molecules found on the surface of T-cells. The T-cell 15 obtained from a vertebrate, preferably a mammal, which has 

receptor as a family exhibits polymorphic binding specificity been immunized or partially immunized with an antigenic 

similar to immunoglobulins in its diversity. The mature ligand (antigen) against which activity is sought, i.e.. a 

T-cell receptor is comprised of alpha and beta chains each preselected antigen. The immunization can be carried out 

having a variable (V) and constant (C) region. The similari- conventionally. Antibody titer in the animal can be moni- 

ties that the T-cell receptor has to immunoglobulins in 20 tared to determine the stage of immunization desired, which 

genetic organization and function shows that T-cell receptor stage corresponds to the amount of enrichment or biasing of 

contains regions of conserved sequence. Lai et al.. Nature, the repertoire desired Partially immunized animals typically 

33 1*543-546 (1988). receive only one immunization and cells are collected from 

Exemplary conserved sequences include those coding for those animals shortly after a response is detected. Fully 

amino acid residues 84-90 of alpha chain, amino acid 25 immunized animals display a peak titer, which is achieved 

residues 107-115 of beta chain, and amino acid residues with one or more repeated injections of the antigen into the 

91-95 and 111-116 of the gamma chain- Rabat et aL, supra, host mammal, normally at 2 to 3 week intervals. Usually 

p 279 three to five days after the last challenge, the spleen is 

Integriiis And Adhesions removed and the genetic repertoire of the splenocytes. about 

Adhesive proteins involved in ceil attachment are mem- 30 90% of which are rearranged B cells, is isolated using 

bers of a large family of related proteins termed integrins. standard procedures. See. Current Protocols in Molecular 

Integrins are heterodimers comprised of a beta and an alpha Biology. Ausubel et al., eds.. John Wiley & Sons. NY. 

subunit. Members of the integrin family include the cell Nucleic acids coding for V H and V L polypeptides can be 

surface glycoproteins platelet receptor GpHb-IIIa. vitronec- derived from cells producing IgA. IgD. IgE. IgG or IgM, 

tin receptor (VnR). fibronectin receptor (FnR) and the leu- 35 most preferably from IgM and IgG. producing cells, 

kocyte adhesion receptors LFA-1. Mac-1. Mo-1 and 60.3. Methods for preparing fragments of genomic DNA from 

Rouslahti et aL, Science, 238:491-497 (1987). Nucleic acid which immunoglobulin variable region genes can be cloned 

and protein sequence data demonstrates regions of con- as a diverse population axe well known in the art. See for 

served sequences exist in the members of these families. example Herrmann et al.. Methods In EnzymoL, 

particularly between the beta chain of GpHb-IIIa. VnR and 40 152:180-183. (1987); Frischauf, Methods In EnzymoU 

FnR, and between the alpha subunit of VnR. Mac-1, LFA-1. 152:183-190 (1987); Frischauf. Methods In EnzymoL, 

FnR and GpffiVIHa. Suzuki et al- Proc. Natl Acad. Sci. 152:190-199 (1987); and DiLeUa et al.. Methods In 

USA. 83:8614-8618. 1986; Ginsberg et al., J. Biol Chem.. EnzymoL, 152:199-212 (1987). (The teachings of the ref- 

262:5437-5440. 1987. erences cited herein are hereby incorporated by reference.) 

Various well known methods can be employed to produce 45 The desired gene repertoire can be isolated from either 

a useful gene repertoire. For instance. V w and V L gene genomic material containing the gene expressing the vari- 

repertoires can be produced by isolating V^- and V t -coding able region or die messenger RNA (mRNA) which repre- 

mRNA from a heterogeneous population of antibody pro- sents a transcript of the variable region. The difficulty in 

during cells. i.e.. B lymphocytes (B cells), preferably rear- using the genomic DNA from other than non-rearranged B 

ranged B cells such as those found in the circulation or 50 lymphocytes is in juxtaposing the sequences coding for the 

spleen of a vertebrate. Rearranged B cells are those in which variable region, where the sequences are separated by 

inununoglobulin gene translocation, i.e., rearrangement, has introns. The DNA fragment(s) containing the proper exons 
occurred as evidenced by the presence in the cell of mRNA must be isolated, the introns excised, and the exons then 

with the immunoglobulin gene V. D and J region transcripts spliced in the proper order and in the proper orientation. For 

adjacently located thereon. Typically, the B cells are col- 55 the most part this will be difficult, so that the alternative 

lected in a 1-100 ml sample of blood which usually contains technique employing rearranged B cells will be the method 

10* B cells/ml of choice because the V, D and J immunoglobulin gene 

In some cases, it is desirable to bias a repertoire for a regions have translocated to become adjacent, so that the 

preselected activity, such as by using as a source of nucleic sequence is continuous (free of introns) for the entire 
acid cells (source cells) from vertebrates in any one of 60 variable regions. 

various stages of age, health and immune response. For Where mRNA is utilized the cells will be lysed under 
example, repeated immunization of a healthy animal prior to RNase inhibiting conditions. In one embodiment the first 

collecting rearranged B cells results in obtaining a repertoire step is to isolate the total cellular mRNA. Poly A+ mRNA 

enriched for genetic material producing a receptor of high can then be selected by hybridization to an oligo-dT cellu- 

affinity. Mullinax et al.. Proc. Natl. Acad. Set. USA. 65 lose column. The presence of mRNAs coding for the heavy 

87:8095-8099(1990). Conversely, collecting rearranged B and/or light chain polypeptides can then be assayed by 
cells from a healthy animal whose immune system has not hybridization with DNA single strands of the appropriate 
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genes. Conveniently, the sequences coding for the constant 
portion of the V„ and can be used as polynucleotide 
probes, which sequences can be obtained from available 
sources. See for example. Early and Hood. Genetic 
Engineering. Setlow and Hollaender, eds.. Vol. 3, Plenum 
Publishing Corporation. NY. (1981). pages 157-188; and 
Rabat et aL Sequences of Immunological Interest. National 
Institutes of Health, Bethesda. Md, (1987). 

In preferred embodiments, the preparation containing the 
total cellular raRNA is first enriched for the presence of \ H 
and/or V L coding mRNA. Enrichment is typically accom- 
plished by subjecting the total mRNA preparation or par- 
tially purified mRNA product thereof to a primer extension 
reaction employing a polynucleotide synthesis primer as 
described herein. Exemplary methods for producing V„ and 
W L gene repertoires using polynucleotide synthesis primers 
are described in PCT Application No. PCT/US 90/02836 
(International Publication No. WO 90/14430). Particularly 
preferred methods for producing a gene repertoire rely on 
the use of preselected oligonucleotides as primers in a 
polymerase chain reaction (PCR) to form PCR reaction 
products as described herein. 

In preferred embodiments, isolated B ceils are immunized 
in vitro against a preselected antigen. In vitro immunization 
is defined as the clonal expansion of epitope-specific B cells 
in culture, in response to antigen stimulation. The end result 
is to increase the frequency of antigen-specific B cells in the 
immunoglobulin repertoire, and thereby decrease the num- 
ber of clones in an expression library that must be screened 
to identify a clone expressing an antibody of the desired 
specificity. The advantage of in vitro immunization is that 
human monoclonal antibodies can be generated against a 
limitless number of therapeutically valuable antigens, 
including toxic or weak immunogens. For example, anti- 
bodies specific for the polymorphic determinants of tumor- 
associated antigens, rheumatoid factors, and histocompat- 
ibility antigens can be produced, which can not be elicited 
in immunized animals. In addition, it may be possible to 
generate immune responses which are normally suppressed 
in vivo. 

In vitro immunization can be used to give rise to either a 
primary or secondary immune response. A primary immune 
response, resulting from first time exposure of a B cell to an 
antigen, results in clonal expansion of epitope-specific cells 
and the secretion of IgM antibodies with low to moderate 
apparent affinity constants (lO^lO 8 M" 1 ). Primary immu- 
nization of human splenic and tonsillar lymphocytes in 
culture can be used to produce monoclonal antibodies 
against a variety of antigens, including cells, peptides. 
macTomolecule. haptens, and tumor- associated antigens. 
Memory B cells from immunized donors can also be stimu- 
lated in culture to give rise to a secondary immune response 
characterized by clonal expansion and the production of 
high affinity antibodies (>\<f M" 1 ) of the IgG isotype, 
particularly against viral antigens by clonally expanding 
sensitized lymphocytes derived from seropositive individu- 
als. 

In one embodiment peripheral blood lymphocytes are 
depleted of various cytolytic cells that appear to down- 
modulate antigen-specific B cell activation. When 
lysosome-rich subpopulations (natural killer cells, cytotoxic 
and suppressor T cells, monocytes) are first removed by 
treatment with the lysosmotropic methyl ester of leucine, the 
remaining cells (including B cells. T helper cells, accessory 
cells) respond antigen-specifically during in vitro immuni- 
zation. The lymph okinc requirements for inducing antibody 
production in culture are satisfied by a culture supernatant 
from activated, irradiated T cells. 
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In addition to in vitro immunization, cell panning 
(immunoaftlnity absorption) can be used to further increase 
the frequency of antigen-specific B cells. Techniques for 
selecting B cell subpopulations via solid-phase antigen 

5 binding are well established. Panning conditions can be 
optimized to selectively enrich for B cells which bind with 
high aflinity to a variety of antigens, including cell surface 
proteins. Panning can be used alone, or in combination with 
in vitro immunization to increase the frequency of antigen- 
ic specific cells above the levels which can be obtained with 
either technique alone. Immunoglobulin expression libraries 
constructed from enriched populations of B cells are biased 
in favor of antigen- specific antibody clones, and thus, 
enabling identification of clones with the desired specifici- 

15 ties from smaller, less complex libraries. 
3. Preparation of Polynucleotide Primers 

The term "polynucleotide" as used herein in reference to 
primers, probes and nucleic acid fragments or segments to 
be synthesized by primer extension is defined as a molecule 

20 comprised of two or more deoxyribonucleotide or 
ribonucleotides, preferably more than 3. Its exact size will 
depend on many factors, which in turn depends on the 
ultimate conditions of use. 
The term "primer" as used herein refers to a polynucle- 

23 otide whether purified from a nucleic acid restriction digest 
or produced synthetically, which is capable of acting as a 
point of initiation of nucleic acid synthesis when placed 
under conditions in which synthesis of a primer extension 
product which is complementary to a nucleic acid strand is 

30 induced. Le.. in the presence of nucleotides and an agent for 
polymerization such as DNA polymerase, reverse tran- 
scriptase and the like, and at a suitable temperature and pH. 
The primer is preferably single stranded for maximum 
efficiency, but may alternatively be in double stranded form. 

35 If double stranded, the primer is first treated to separate it 
from its complementary strand before being used to prepare 
extension products. Preferably, the primer is a polydeoxyri- 
bonucleotide. The primer must be sufficiently long to prime 
the synthesis of extension products in the presence of the 

40 agents for polymerization. The exact lengths of the primers 
will depend on may factors, including temperature and the 
source of primer. For example, depending on the complexity 
of the target sequence, a polynucleotide primer typically 
contains 15 to 25 ox more nucleotides, although it can 

45 contain fewer nucleotides. Short primer molecules generally 
require cooler temperatures to form sufficiently stable hybrid 
complexes with template. 

The primers used herein are selected to be "substantially" 
complementary to the different strands of each specific 

50 sequence to be synthesized or amplified. This means that the 
primer must be sufficiently complementary to no n -randomly 
hybridize with its respective template strand. Therefore, the 
primer sequence may or may not reflect the exact sequence 
of the template. For example, a non-complementary nucle- 

55 otide fragment can be attached to the 5' end of the primer, 
with the remainder of the primer sequence being substan- 
tially complementary to the strand. Such non- 
complementary fragments typically code for an endonu- 
clease restriction site. Alternatively, non-complementary 

60 bases or longer sequences can be interspersed into the 
- primer, provided the primer sequence has sufficient comple- 
mentarily with the sequence of the strand to be synthesized 
or amplified to non-randomly hybridize therewith and 
thereby form an extension product under polynucleotide 

65 synthesizing conditions. 

Primers of the present invention may also contain a 
DNA-dependent RNA polymerase promoter sequence or its 
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complement. See for example. Krieg et al.. Nuci Acids Res., 
12:7057-70 (1984); Studier et al.. 7. Mol. Biol., 
189:113-130 (1986); and Molecular Cloning: A Laboratory 
Manual, Second Edition. Maniatis et al, eds.. Cold Spaing 
Harbor, N.Y. (1989). 

When a primer containing a DNA-dependent RNA poly- 
merase promoter is used the primer is hybridized to the 
polynucleotide strand to be amplified and the second poly- 
nucleotide strand of the DNA-dependent RNA polymerase 
promoter is completed using an inducing agent such as £ 
coli DNA polymerase L or the Klenow fragment of £. coli 
DNA polymerase. The starting polynucleotide is amplified 
by alternating between the production of an RNA polynucle- 
otide and DNA polynucleotide. 

Primers may also contain a template sequence or repli- 
cation initiation site for a RNA-directed RNA polymerase. 
Typical RNA-directed RNA polymerase include the QB 
replicase described by Lizardi et al.. Biotechnology, 
6:1197-1202 (1988) RNA-directed polymerases produce 
large numbers of RNA strands from a small number of 
template RNA strands that contain a template sequence or 
replication initiation site. These polymerases typically give 
a one million-fold amplification of the template strand as has 
been described by Kramer et al. J. Mol. Biol.. 89:719-736 
(1974). 

The polynucleotide primers can be prepared using any 
suitable method, such as. for example, the phosphotriester or 
phosphodi ester methods see Narang et aL, Metfu EnzymoU 
68:90. (1979); U.S. Pat. No. 4356.270; and Brown et aL. 
Metk EnzymoL 68:109, (1979). 

The choice of a primer's nucleotide sequence depends on 
factors such as the distance on the nucleic acid from the 
region coding for the desired receptor, its hybridization site 
on the nucleic acid relative to any second primer to be used, 
the number of genes in the repertoire it is to hybridize to, and 
the like. 

a. Primers for Producing Immunoglobulin Gene Repertoires 
V H and V L gene repertoires can be separately prepared 
prior to their utilization in the present invention. Repertoire 
preparation is typically accomplished by primer extension, 
preferably by primer extension in a polymerase chain reac- 
tion (PGR) format 

To produce a repertoire of V^rcoding DNA homologs by 
primer extension, the nucleotide sequence of a primer is 
selected to hybridize with a plurality of immunoglobulin 
heavy chain genes at a site substantially adjacent to the 
V^-coding region so that a nucleotide sequence coding for 
a functional (capable of binding) polypeptide is obtained. To 
hybridize to a plurality of different V^coding nucleic acid 
strands, the primer must be a substantial complement of a 
nucleotide sequence conserved among the different strands. 
Such sites include nucleotide sequences in the constant 
region, any of the variable region framework regions, pref- 
erably the third framework region, leader region, promoter 
region. J region and the like. 

If the repertoires of V^coding and V L -coding DNA 
homologs are to be produced by (PGR) amplification, two 
primers, Le., a PCR primer pair, must be used for each 
coding strand of nucleic acid to be amplified. The first 
primer becomes part of the nonsense (minus or 
complementary) strand and hybridizes to a nucleotide 
sequence conserved among V H (plus or coding) strands 
within the repertoire. To produce V M coding DNA 
homologs, first primers are therefore chosen to hybridize to 
(i.e. be complementary to) conserved regions within the J 
region. CHI region, hinge region, CH2 region, or CH3 
region of immunoglobulin genes and the like. To produce a 
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coding DNA homolog. first primers are chosen to hybrid- 
ize with (i.e. be complementary to) a conserved region 
within the J region or constant region of immunoglobulin 
light chain genes and the like. Second primers become part 

5 of the coding (plus) strand and hybridize to a nucleotide 
sequence conserved among minus strands. To produce the 
V^coding DNA homologs. second primers are therefore 
chosen to hybridize with a conserved nucleotide sequence at 
the 5' end of the V^-coding immunoglobulin gene such as in 

io that area coding for the leader or first framework region. It 
should be noted that in the amplification of both V^- and 
Vj-coding DNA homologs the conserved 5' nucleotide 
sequence of the second primer can be complementary to a 
sequence exogenously added using terminal deoxynucleoti- 

15 dyl transferase as described by Loh et al.. Science, 
243:217-220 (1989). One or both of the first and second 
primers can contain a nucleotide sequence defining an 
endonuclease recognition site. The site can be heterologous 
to the immunoglobulin gene being amplified and typically 

20 appears at or near the 5* end of the primer. 

When present, the restriction site-defining portion is typi- 
cally located in a S'-terminal non-priming portion of the 
primer. The restriction site defined by the first primer is 
typically chosen to be one recognized by a restriction 

25 enzyme that does not recognize the restriction site defined 
by the second primer, the objective being to be able to 
produce a DNA molecule having cohesive termini that are 
non-complementary to each other and thus allow directional 
insertion into a vector. 

30 In one embodiment, the present invention utilizes a set of 
polynucleotides that form primers having a priming region 
located at the 3 , -texminus of the primer. The priming region 
is typically the 3 '-most (y-terrninal) 15 to 30 nucleotide 
bases. The 3 , -terminal priming portion of each primer is 

35 capable of acting as a primer to catalyze nucleic acid 
synthesis, i.e., initiate a {rimer extension reaction off its 3' 
terminus. One or both of the primers can additionally 
contain a 5 -terminal (5 -most) non-priming portion, i.e.. a 
region that does not participate in hybridization to repertoire 

40 template. 

In PCR. each primer works in combination with a second 
primer to amplify a target nucleic arid sequence. The choice 
of PCR primer pairs for use in PCR is governed by consid- 
erations as discussed herein for producing gene repertoires. 
45 That is. the primers have a nucleotide sequence that is 
complementary to a sequence conserved in the repertoire. 
Useful V w and V L priming sequences are shown in Tables 5 
and 6, herein below. 

4. Polymerase Chain Reaction to Produce Gene Repertoires 

50 The strategy used for cloning the V w and V t genes 
contained within a repertoire will depend, as is well known 
in the art. on the type, complexity, and purity of the nucleic 
acids making up the repertoire. Other factors include 
whether or not the genes are contained in one or a plurality 

55 of repertoires and whether or not they are to be amplified 
and/or mutagen] zed. 

The and V A -coding gene repertoires are comprised of 
polynucleotide coding strands, such as mRNA and/or the 
sense strand of genomic DNA. If the repertoire is in the form 

60 of double stranded genomic DNA. it is usually first 
denatured, typically by melting, into single strands. A rep- 
ertoire is subjected to a PCR reaction by treating 
(contacting) the repertoire with a PCR primer pair, each 
member of the pair having a preselected nucleotide 

65 sequence. The PCR primer pair is capable of initiating 
primer extension reactions by hybridizing to nucleotide 
sequences, preferably at least about 10 nucleotides in length 
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and more preferably at least about 20 nucleotides in length, 
conserved within the repertoire. The first palmer of a PCR 
primer pair is sometimes referred to herein as the "sense 
primer*' because it hybridizes to the coding or sense strand 
of a nucleic acid. In addition, the second primer of a PCR 
primer pair is sometimes referred to herein as the "anti-sense 
primer" because it hybridizes to a non-coding or anti- sense 
strand of a nucleic acid, i.e., a strand complementary to a 
coding strand. 

The PCR reaction is performed by mixing the PCR primer 
pair, preferably a predetermined amount thereof, with the 
nucleic acids of the repertoire, preferably a predetermined 
amount thereof, in a PCR buffer to form a PCR reaction 
admixture. The admixture is maintained under polynucle- 
otide synthesizing conditions for a time period, which is 
typically predeterrnined. sufficient for the formation of a 
PCR reaction product thereby producing a plurality of 
different V^oding and/or V £ -coding DNA homologs. 

A plurality of first primer and/or a plurality of second 
primers can be used in each amplification, e.g.. one species 
of first primer can be paired with a number of different 
second primers to form several different primer pairs. 
Alternatively, an individual pair of first and second primers 
can be used. In any case, the amplification products of 
amplifications using the same or different combinations of 
first and second primers can be combined to increase the 
diversity of the gene library. 

In another strategy, the object is to clone the V^- and/or 
V^-coding genes from a repertoire by providing a polynucle- 
otide complement of the repertoire, such as the anti-sense 
strand of genomic dsDNA or the polynucleotide produced 
by subjecting mRNA to a reverse transcriptase reaction. 
Methods for producing such complements are well known in 
the art 

The PCR reaction is performed using any suitable 
method. Generally it occurs in a buffered aqueous solution, 
Le., a PCR buffer, preferably at a pH of 7-9. most preferably 
about 8. Preferably, a molar excess (for genomic nucleic 
acid, usually about 10 6 :1 rximer:template) of the primer is 
admixed to the buffer containing the template strand. A large 
molar excess is preferred to improve the efficiency of the 
process. 

The PCR buffer also contains the deoxyribonucleoude 
triphosphates dATP, dCTP. dGTP. and dTTP and a 
polymerase, typically thermostable, all in adequate amounts 
for primer extension (polynucleotide synthesis) reaction. 
The resulting solution (PCR admixture) is heated to about 
90° C. -100° C. for about 1 to 10 minutes, preferably from 
1 to 4 minutes. After this heating period the solution is 
allowed to cool to 54° C which is preferable for primer 
hybridization. The synthesis reaction may occur at from 
room temperature up to a temperature above which the 
polymerase (inducing agent) no longer functions efficiently. 
Thus, for example, if DNA polymerase is used as inducing 
agent the temperature is generally no greater than about 40° 
C. An exemplary PCR buffer comprises the following: 50 
mM KC1; 10 mM Tris-HG; pH 8.3; 1.5 mM Mgcl 2 ; 0.001% 
(wt/vol) gelatin, 200 uM dATP; 200 uM dTTP; 200 uM 
dCTP; 200 uM dGTP; and 2.5 units Thermus aquaticus 
DNA polymerase I (U.S. Pat. No. 4,889,818) per 100 
microliters of buffer. 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 
for this purpose include, for example, E. colt DNA poly- 
merase L Klenow fragment of E. coli DNA polymerase I, T4 
DNA polymerase, other available DNA polymerases. 
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reverse transcriptase, and other enzymes, including heat- 
stable enzymes, which will facilitate combination of the 
nucleotides in the proper manner to form the primer exten- 
sion products which are complementary to each nucleic acid 

5 strand. Generally, the synthesis will be initiated at the 3' end 
of each primer and proceed in the 5' direction along the 
template strand, until synthesis terminates, producing mol- 
ecules of different lengths. There may be inducing agents, 
however, which initiate synthesis at the 5* end and proceed 

10 in the above direction, using the same process as described 
above. 

The inducing agent also may be a compound or system 
which will function to accomplish the synthesis of RNA 
primer extension products, including enzymes. In preferred 

15 embodiments, the inducing agent may be a DNA-dependent 
RNA polymerase such as T7 RNA polymerase. T3 RNA 
polymerase or SP6 RNA polymerase. These polymerases 
produce a complementary RNA polynucleotide. The high 
turn over rate of the RNA polymerase amplifies the starting 

20 polynucleotide as has been described by Chamber iin et al.. 
The Enzymes. ed_ P. Boyer, PP. 87-108, Academic Press, 
New York (1982). Another advantage of T7 RNA poly- 
merase is that mutations can be introduced into the poly- 
nucleotide synthesis by replacing a portion of cDNA with 

25 one or more mutagenic oligodeoxynucleotides 
(polynucleotides) and transcribing the partially-mismatched 
template directly as has been previously described by Joyce 
et aL. Nuc. Acid Res^ 17:711-722 (1989). Amplification 
systems based on transcription have been described by 

30 Gingeras et al., in PCR Protocols, A Guide to Methods and 
Applications, pp 245-252. Academic Press. Inc.. San Diego. 
Calif. (1990). 

If the inducing agent is a DNA-dependent RNA poly- 
merase and therefore incorporates ribonucleotide 

35 triphosphates, sufficient amounts of ATP. CTP. GTP and 
UTP are admixed to the primer extension reaction admixture 
and the resulting solution is treated as described above. 

The newly synthesized strand and its complementary 
nucleic acid strand form a double-stranded molecule which 

40 can be used in the succeeding steps of the process. 

The first and/or second PCR reactions discussed above 
can advantageously be used to incorporate into the receptor 
a preselected epitope useful in immunologically detecting 
and/or isolating a receptor. This is accomplished by utilizing 

45 a first and/or second polynucleotide synthesis primer or 
expression vector to incorporate a predetermined amino acid 
residue sequence into the amino acid residue sequence of the 
receptor. 

After producing and V^-coding DNA homologs for a 

50 plurality of different and V^-coding genes within the 
repertoires, the DNA molecules are typically further ampli- 
fied. While the DNA molecules can be amplified by classic 
techniques such as incorporation into an autonomously 
replicating vector, it is preferred to first amplify the mol- 

55 ecules by subjecting them to a polymerase chain reaction 
(PCR) prior to inserting them into a vector. PCR is typically 
carried out by thermocyding i.e.. repeatedly increasing and 
decreasing the temperature of a PCR reaction admixture 
within a temperature range whose lower limit is about 10° C. 

60 to about 40° C. and whose upper limit is about 90° C. to 
about 100° C. The increasing and decreasing can be 
continuous, but is preferably phasic with time periods of 
relative temperature stability at each of temperatures favor- 
ing polynucleotide synthesis, denaturation and hybridiza- 

65 tion. 

PCR amplification methods are described in detail in U.S. 
Pat Nos. 4.683.192. 4.683.202, 4.800.159. and 4.965.188. 
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and at least in several texts including "PCR Technology: 
Principles and Applications for DNA Amplification". H. 
Erlich. ed.. Stockton Press. New York (1989); and "PCR 
Protocols: A Guide to Methods and Applications". Innis et 
al.. eds.. Academic Press, San Diego. Calif. (1990). 

In preferred embodiments only one pair of first and 
second primers is used per amplification reaction. The 
amplification reaction products obtained from a plurality of 
different amplifications, each using a plurality of different 
primer pairs, are then combined. 

However, the present invention also contemplates DNA 
homolog production via co-amplification (using two pairs of 
primers), and multiplex amplificatioD (using up to about 8. 
9 or 10 primer pairs). 

In preferred embodiments, the PCR process is used not 
only to produce a library of DNA molecules, but also to 
induce mutations within the library or to create diversity 
from a single parental clone and thereby provide a library 
having a greater heterogeneity. First, it should be noted that 
the PCR process itself is inherently mutagenic due to a 
variety of factors well known in the art. Second, in addition 20 
to the mutation inducing variations described in the above 
referenced U.S. Pat No. 4.683,195. other mutation inducing 
PCR variations can be employed. For example, the PCR 
reaction admixture, can be formed with different amounts of 
one or more of the nucleotides to be incorporated into the 25 
extension product. Under such conditions, the PCR reaction 
proceeds to produce nucleotide substitutions within the 
extension product as a result of the scarcity of a particular 
base. Similarly, approximately equal molar amounts of the 
nucleotides can be incorporated into the initial PCR reaction 30 
admixture in an amount to efficiently perform X number of 
cycles, and then cycling the admixture through a number of 
cycles in excess of X, such as. for instance. 2X. 
Alternatively, mutations can be induced during the PCR 
reaction by incorporating into the reaction admixture nucle- 35 
otide derivatives such as inosine. not normally found in the 
nucleic acids of the repertoire being amplified. During 
subsequent in vivo amplification, the nucleotide derivative 
will be replaced with a substitute nucleotide thereby induc- 
ing a point mutation. <w 

5. Linear DNA Expression Vectors 
A DNA expression vector for use in a method of the 

invention for producing a library of DNA molecules is a 
linearized DNA molecule as described before having two 
(upstream and downstream) cohesive termini adapted for 45 
directional ligation to a polypeptide gene. 

A linear DNA expression vector is typically prepared by 
restriction endonuclease digestion of a circular DNA expres- 
sion vector of this invention to cut at two preselected 
restriction sites within the sequence of nucleotides of the 50 
vector adapted for directional ligation to produce a linear 
DNA molecule having the required cohesive termini that are 
adapted for direction ligation. Directional ligation refers to 
the presence of two (a first and second) cohesive termini on 
a vector, or on the insert DNA molecule to be Ugated into the 55 
vector selected, so that the termini on a single molecule are 
not complementary. A first terminus of the vector is comple- 
mentary to a first terminus of the insert and the second 
terminus of the vector is complementary to the second 
terminus of the insert 60 

6. Ligation Reactions to Produce Gene Libraries 
In preparing a library of DNA molecules of this invention, 

a ligation admixture is prepared as described above, and the 
admixture is subjected to ligation conditions for a time 
period sufficient for the admixed repertoire of polypeptide 65 
genes to ligate (become operatively linked) to the plurality 
of DNA expression vectors to form the library. 



Ligation conditions are conditions selected to favor a 
ligation reaction wherein a phosphodiester bond is formed 
between adjacent 3' hydroxy! and 5* phosphoryl termini of 
DNA. The ligation reaction is preferably catalyzed by the 
5 enzyme T4 DNA ligase. Ligation conditions can vary in 
time, temperature, concentration of buffers, quantities of 
DNA molecules to be Ugated. and amounts of ligase, as is 
well known. Preferred ligation conditions involve maintain-, 
ing the ligation admixture at 4 degrees Centigrade (4° C.) to 
t 12° C. for 1 to 24 hours in the presence of 1 to 10 units of 
1 T4 DNA ligase per milliliter (ml) and about 1 to 2 micro- 
grams (ug) of DNA. Ligation buffer in a ligation admixture 
typically contains 0.5M Tris-HCl (pH 7.4). 0.01M MgCl 2 . 
0.01M dithiothrietol. 1 mM spermidine. 1 mM ATP and 0.1 
mg/ml bovine serum albumin (BSA). Other ligation buffers 
can also be used. 

Exemplary ligation reactions are described in Example 2. 
7. Preparation of Dicistronic Gene Libraries 

In a particularly preferred erarxxliment the present inven- 
tion contemplates methods for the preparation of a library of 
dicistronic DNA molecules. A dicistronic DNA molecule is 
a single DNA molecule having the capacity to express two 
separate polypeptides from two separate cistrons. In pre- 
ferred embodiments, the two cistrons are operativeiy linked 
at relative locations on the DNA molecule such that both 
cistrons are under the transcriptional control of a single 
promoter. Each dicistronic molecule is capable of expressing 
first and second polypeptides from first and second cistrons. 
respectively, that can form, in a suitable host, a het- 
erodimeric receptor on the surface of a filamentous phage 
particle. 

The method for producing a library of dicistronic DNA 
molecules comprises the steps of: 

(a) Forming a first ligation admixture by combining in a 
ligation buffer 

(i) a repertoire of first polypeptide genes in the form of 
dsDNA. each having cohesive termini adapted for 
directional ligation, and 

(ii) a plurality of DNA expression vectors in linear 
form, each having upstream and downstream first 
cohesive termini that are (a) adapted for directionally 
receiving the first polypeptide genes in a common 
reading frame, and (b) operatively linked to respec- 
tive upstream and downstream translatable DNA 
sequences. The upstream translatable DNA sequence 
encodes a pelB secretion signal, the downstream 
translatable DNA sequence encodes a filamentous 
phage coat protein membrane anchor, and translat- 
able DNA sequences are operatively linked to 
respective upstream and downstream DNA expres- 
sion control sequences. 

(b) Subjecting the admixture to ligation conditions for a 
time period sufficient to operativeiy link the first 
polypeptide genes to the vectors and produce a plurality 
of circular DNA molecules each having a first cistron 
for expressing the first polypeptide. 

(c) Treating the plurality of circular DNA molecules under 
DNA cleavage conditions to produce a plurality of 
DNA expression vectors in linear form that each have 
upstream and downstream second cohesive terrnini that 
are (i) adapted for directionally receiving a repertoire of 
second polypeptide genes in a common reading frame, 
and (ii) operatively linked to respective upstream and 
downstream DNA sequences. The upstream DNA 
sequence is a translatable sequence encoding a secre- 
tion signal, the downstream DNA sequence has at least 
one stop codon in the reading frame, and the translat- 



5/75 

29 

able DNA sequence is operatively linked to a DNA 
expression control sequence. 

(d) Forming a second ligation admixture by combining in" 
a ligation buffer: 

(i) the plurality of DNA expression vectors formed in 
step (c). and 

(ii) the repertoire of second polypeptide genes in the 
form of dsDNA. each having cohesive termini 
adapted for directional ligation to the plurality of 
DNA expression vectors; and 

(e) Subjecting the second admixture to ligation conditions 
for a time period sufficient to operatively link the 
second polypeptide genes to said vectors and produce 
a plurality of circular DNA molecules each having the 
second cistron for expressing the second polypeptide, 
thereby forming the library. In preferred embodiments 
a secretion signal is a pelB secretion signal, and the 
membrane anchor is derived from cpVm as described 
herein. 

DNA expression vectors useful for practicing the above 
method are the dicistronic expression vectors described in 
greater detail before. 

In practicing the method of producing a library of dicis- 
tronic DNA molecules, it is preferred that the upstream and 
downstream first cohesive termini do not have the same 
nucleotide sequences as the upstream and downstream sec- 
ond cohesive termini. In this embodiment, the treating step 
(c) to linearize the circular DNA molecules typically 
involves the use of restriction endonucleases that are specific 
for producing said second termini, but do not cleave the 
circular DNA molecule at the sites that formed the first 
termini. Exemplary and preferred first and second termini 
are the termini defined by cleavage of pCBAK8 with Xho I 
and Spe I to form the upstream and downstream first termini 
and defined by cleavage of pCBAK8 with Sac I and Xba I 
to form the upstream and downstream second termini. In this 
embodiment, other pairs of cohesive termini can be utilized 
at the respective pairs of first and second termini, so long as 
the four termini are each distinct, non-complementary ter- 
mini. 

Methods of treating the plurality of circular DNA mol- 
ecules under DNA cleavage conditions to form linear DNA 
molecules are generally well known and depend on the 
nucleotide sequence to be cleaved and the mechanism for 
cleavage. Preferred treatments involve admixing the DNA 
molecules with a restriction endonudease specific for a 
endonuclease recognition site at the desired cleavage loca- 
tion in an amount sufficient for the restriction endonuclease 
to cleave the DNA molecule. Buffers, cleavage conditions, 
and substrate concentrations for restriction endonuclease 
cleavage are well known and depend on the particular 
enzyme utilized. Exemplary restriction enzyme cleavage 
conditions are described in Example 2. 

F. Phage Libraries 

The present invention contemplates a library of DNA 
molecules that each encode a fusion protein of this invention 
where the library is in the form of a population of different 
filamentous phage particles each containing one of the 
different rDNA molecules. By different fiSA molecule is 
meant rDNA molecules differing in nucleotide base 
sequence encoding a polypeptide of this invention. 

Thus, a phage library is a population of filamentous 
phage, preferably fl. fd or MD filamentous phage, each 
phage having packaged inside the particle a rDNA expres- 
sion vector of this invention. A preferred library is com- 
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prised of phage particles containing DNA molecules that 
encode at least 10 6 . preferably 10 y and more preferably 
10 8- * different fusion proteins of this invention. By different 
fusion proteins is meant fusion proteins differing in amino 

5 acid residue sequence. Where the packaged expression vec- 
tor encodes first and second polypeptides of an autogenously 
assembling receptor, e.g. V H and V L polypeptides that form 
a Fab. the library can also be characterized as containing or 
expressing a multiplicity of receptor specificities. Thus. 

10 preferred libraries express at least 10 s . preferably at least 10 6 
and more preferably at least 10 7 different receptors, such as 
different antibodies. T cell receptors, in tc grins and the like. 

As described herein, a particular advantage of a filamen- 
tous phage in the present invention is that the DNA molecule 

1 5 present in the phage particle and encoding one or both of the 
members of the heterodimeric receptor can be segregated 
from other DNA molecules present in the library on the basis 
of the presence of the particular expressed fusion protein the 
surface of the phage particle. 

20 Isolation (segregation) of a DNA molecule encoding one 
or both members of a heterodimeric receptor is conducted by 
segregation of the filamentous phage particle containing the 
gene or genes of interest away from the population of other 
phage particles comprising the library. Segregation of phage 

25 particles involves the physical separation and propagation of 
individual phage particles away from other particles in the 
library. Methods for physical separation of filamentous 
phage particles to produce individual particles, and the 
propagation of the individual particles to form populations 

30 of progeny phage derived from the individual segregated 
particle are well known in the filamentous phage arts. 

A preferred separation method involves the identification 
of the expressed heterodimer on the surface of the phage 

35 particle by means of a ligand binding specificity between the 
phage particle and a preselected ligand. Exemplary and 
preferred is the use of 'Spanning" methods whereby a sus- 
pension of phage particles is contacted with a solid phase 
ligand (antigen) and allowed to specifically bind (or iramu- 

^ noreact where the heterodimer includes an immunoglobulin 
variable domain). After binding, non-bound particles are 
washed off the solid phase, and the bound phage particles are 
those that contain ligand-specific heterodimeric receptor 
(heterodimer) on their surface. The bound particles can then 

45 be recovered by elution of the bound particle from die solid 
phase, typically by the use of aqueous solvents having high 
ionic strength sufficient to disrupt the receptor-ligand bind- 
ing interaction. 
An alternate method for separating a phage particle based 

50 on the ligand specificity of the surface-expressed het- 
erodimer from a population of particles is to precipitate the 
phage particles from the solution phase by aosslinkage with 
the ligand. An exemplary and preferred crosslinking and 
precipitation method is described in detail in Example 4c. 

55 The use of the above particle segregation methods pro- 
vides a means for screening a population of filamentous 
phage particles present in a phage library of this invention. 
As applied to a phage library, screening can be utilized to 
enrich the library for one or more particles that express a 

60 heterodimer having a preselected ligand binding specificity. 
Where the library is designed to contain multiple species of 
heterodimers that all have some detectable measure of 
ligand binding activity, but differ in protein structure, 
antigenicity, ligand binding affinity or avidity, and the like, 

65 the screening methods can be utilized sequentially to first 
produce a library enriched for a preselected binding 
specificity, and then to produce a second library further 
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enriched by further screening comprising one or more 
isolated phage particles. Methods for measuring ligand 
binding activities, antigenicity and the like interactions 
between a ligand and a receptor arc generally well known 
and are not discussed further as they arc not essential 
features of the present invention. 

Thus, in one embodiment, a phage library is a population 
of particles enriched for a preselected ligand binding speci- 
ficity. 

In another embediment. a phage library comprises a 
population of particles wherein each particle contains at 
least one fusion protein of this invention on the surface of 
the phage particle. The actual amount of fusion protein 
present on the surface of a phage particle depends, in part, 
on the choice of coat protein membrane anchor present in the 
fusion protein. Where the anchor is derived from cpIDL there 
are typically about 1 to 4 fusion proteins per phage particle. 
Where the anchor is derived from the more preferred cpVTH 
there is the potential for hundreds of fusion proteins on the 
particle surface depending on the growth conditions and 
other factors as discussed herein. Preferably, a phage particle 
in a library contains from about 10 to about 500 cpVTH- 
derived fusion proteins on the surface of each particle, and 
more preferably about 20 to 50 fusion proteins per particle. 
Exemplary amounts of surface fusion protein are shown by 
the electron micrographs described in Example 4a that 
describe particles having about 20 to 24 cpVTH-derived 
fusion proteins per particle. 

In another embodiment, the present invention contem- 
plates a population of phage particles that are the progeny of 
a single particle, and therefor all express the same het- 
erodimer on the particle surface. Such a population of phage 
are homogeneous and clonally derived, and therefore pro- 
vide a source for expressing large quantities of a particular 
fusion protein. An exemplary clonally homogeneous phage 
population is described in Example 4. 

A filamentous phage particle in a library of this invention 
is produced by standard filamentous phage particle prepa- 
ration methods and depends on the presence in a DNA 
expression vector of this invention of a filamentous phage 
origin of replication as described herein to provide the 
signals necessary for (1) production of a single-stranded 
filamentous phage replicative form and (2) packaging of the 
replicative form into a filamentous phage particle. Such a 
DNA molecule can be packaged when present in a bacterial 
cell host upon introduction of genetic compieiuentatioo to 
provide the filamentous phage proteins required for produc- 
tion of infectious phage particles. A typical and preferred- 
method for genetic complementation is to infect a bacterial 
host cell containing a DNA expression Vector of this inven- 
tion with a helper filamentous phage, thereby providing the 
genetic elements required for phage particle assembly. 
Exemplary helper rescue methods are described herein at 
Example 2, and described by Short et al; Nuc. Acids Res., 
16:7583-7600 (1988). 

The level of heterodimeric receptor captured on the 
surface of a filamentous phage particle during the process of 
phage particle extrusion from the host cell can be controlled 
by a variety of means. In one embodiment, the levels of 
fusion proteins are controlled by the use of strong promoters 
in the first and second cistrons for expressing the 
polypeptides, such that transcription of the ftision protein 
cistrons occurs at a relative rate greater than the rate of 
transcription of the cpVUI gene on the helper phage. In 
another embodiment, the helper phage can have an amber 
mutation in the gene for expressing cpVTH, such that less 
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wild-type cpVUI is transcribed in the host cell than fusion 
proteins, thereby leading to increased ratios of fusion protein 
compared to cpVUI during the extrusion process. 

In another embodiment, the amount of helerodimeric 

5 receptor on the phage particle surface can be controlled by 
controlling the timing between expression of fusion proteins 
and the superinfection by helper phage. After introduction of 
the expression vector, longer delay times before the addition 
of helper phage will allow for increased accumulation of the 

10 fusion proteins in the host cell. 

EXAMPLES 

The following examples are intended to illustrate, but not 
15 limit, the scope of the invention. 

1. Construction of a Dicistronic Expression Vector 
for Producing a Heterodimeric Receptor on Phage 
Particles 

20 * To obtain a vector system for generating a large number 
of Fab antibody fragments that can be screened directly, 
expression libraries in bacteriophage Lambda have previ- 
ously been constructed as described in Huse et al.. Science, 
246:1275-1281 (1989). These systems did not contain 

25 design features that provide for the expressed Fab to be 
targeted to the surface of a filamentous phage particle. 
* The main criterion used in choosing a vector system was 
the necessity of generating the largest number of Fab frag- 

30 ments which could be screened directly. Bacteriophage 
Lambda was selected as the starting point to develop an 
expression vector for three reasons. First, in vitro packaging 
of phage DNA was the most efficient method of reintroduc- 
ing DNA into host cells. Second, it was possible to detect 

35 protein expression at the level of single phage plaques. 
Finally, the screening of phage libraries typically involved 
less difficulty with nonspecific binding. The alternative, 
plasmid cloning vectors, are only advantageous in the analy- 
sis of clones after they have been identified. This advantage 

40 was not lost in the present system because of the use of a 
dicistronic expression vector such as pCombVUI. thereby 
permitting a plasmid containing the heavy chain, light chain, 
or Fab expressing inserts to be excised. 

45 a. Construction of Dicistronic Expression Vector 
pCOMB 

(i) Preparation of Lambda Zap™ II 

Lambda Zap™ II is a derivative of the original Lambda 
Zap (ATCC #40.298) that maintains all of the characteristics 

50 of the original Lambda Zap including 6 unique cloning sites, 
fusion protein expression, and the ability to rapidly excise 
the insert in the form of a phagemid (Bluescript SK-), but 
lacks the SAM 100 mutation, allowing growth on many 
Non-Sup F strains, including XLl-Blue. The Lambda Zap™ 

55 H was constructed as described in Short et al. Nuc. Acids 
Res., 16:7583-7600, 1988, by replacing the Lambda S gene 
contained in a 4254 base pair (bp) DNA fragment produced 
by digesting Lambda Zap with the restriction enzyme Nco L 
This 4254 bp DNA fragment was replaced with the 4254 bp 

60 DNA fragment containing the Lambda S gene isolated from 
Lambda gtlO (ATCC #40.179) after digesting the vector 
with the restriction enzyme Nco L The 4254 bp DNA 
fragment isolated from lambda gtlO was ligated into the 
original Lambda Zap vector using T4 DNA ligase and 

65 standard protocols such as those described in Current Pro- 
tocols in Molecular Biology. Ausubel et al.. eds., John Wiley 
and Sons. NY. 1987. to form Lambda Zap™ U. 
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(ii) Preparation of Lambda Hc2 

To express a plurality of V^-coding DNA homologs in an 
£ coli host cell, a vector designated Lambda Hc2 was 
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solution was maintained at 37° C. for 30 minutes and then 
theT4 polynucleotide kinase was inactivated by maintaining 
the solution at 65° C. for 10 minutes. 



TABLE 3 



SEQ. 
ID. NO. 



(22) NI ) 5' GGCCGCAAATTCTAITTCAAGGAGACAGTCAr 3' 

(23) N2) 5' AATCAAAIACClATTGCCTACGGCAGCCGCrGGATT 3' 

(24) N3) 5' CnTAmCTCGCTGCCCAACCAG CCAlG GCCC 3* 

(25) N6) 5* CAGTrTCACCTGGGCCAlXXXntJGTrtXX; 3* 

(26) N7) 5* C AGCGAOTAAIAAC AAJCCA GC GG CTGCCGT AGGC A ATAG 3' 

(27) N8) 5' GTATTTC AITATG ACTCnCTCCTTCAAATAG AATTTGC 3' 

(28) N^4) 5' AGGTGAAACTGCTCGAGATTTCTAGACrAGTrACCCGTAC 3' 

(29) N10-5) 5' CGG AACGTCGTAC GGGTAACTAGTCTAG AAATCTC G AG 3* 

(30) Nil) 5' GACGTTCCGGACTACGGTTCTTAAIAGAAITCG J 

(31) N12) 5' TCGACGAATTCTATIAAGAACCGTAGTC 3' 



constructed. The vector provided the following: the capacity 
to place the V^-coding DNA homologs in the proper reading 
frame; a ribosome binding site as described by Shine et aL. 
Nature, 254:34. 1975; a leader sequence directing the 
expressed protein to the periplasmic space designated the 
pelB secretion signal; a polynucleotide sequence that coded 
for a known epitope (epitope tag); and also a polynucleotide 
that coded for a spacer protein between the V^-coding DNA 
homolog and the polynucleotide coding for the epitope tag. 
Lambda Hc2 has been previously described by Huse et al.. 
Science, 246:1275-1281 (1989). 

To prepare Lambda Hc2. a synthetic DNA sequence 
containing all of the above features was constructed by 
designing single stranded polynucleotide segments of 20-40 
bases that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in FIG. 3. The 
individual single-stranded polynucleotide segments are 
shown in Table 3. 

Polynucleotides N2, N3, N9^t, NIL N10-5, N6, N7 and 
N8 (Table 3) were kinased by adding 1 ul of each poly- 
nucleotide 0.1 micrograms/microliter (ug/ul) and 20 units of 
T 4 polynucleotide kinase to a solution containing 70 mM 
Tris-HCL pH 7.6, 10 mM MgCl 2 , 5 mM dithiothreitol 
(OTT), 10 mM beta-mercaptoethanol, 500 micrograms per 
milliliter (ug/mi) bovine serum albumin (BSA). The solution 
was maintained at 37 degrees Centigrade (37° C.) for 30 
minutes and the reaction stopped by maintaining the solution 
at 65° C for 10 minutes. The two end polynucleotides, 20 
ng of polynucleotides Nl and polynucleotides N12, were 
added to the above Itinasing reaction solution together with 
Mo volume of a solution containing 20.0 mM Tris-HCL pH 
7.4, 2.0 mM Mgd 2 and 50.0 mM Nad. This solution was 
heated to 70° C. for 5 minutes and allowed to cool to room 
temperature, approximately 25° C. over 1.5 hours in a 500 
ml beaker of water. During this time period all 10 poly- 
nucleotides annealed to form the double stranded synthetic 
DNA insert shown in FIG. 3. The individual polynucleotides 
were covalently linked to each other to stabilize the syn- 
thetic DNA insert by adding 40 ul of the above reaction to 
a solution containing 50 mM Tris-HCL pH 7.5, 7 mM 
MgCl 2 .l mM DTT. 1 mM adenosine triphosphate (ATP) and 
10 units of T4 DNA ligase. This solution was maintained at 
37° C. for 30 minutes and then the T4 DNA ligase was 
inactivated by main tainin g the solution at 65° C. for 10 
minutes. The end polynucleotides were kinased by mixing 
52 ul of the above reaction. 4 ul of a solution containing 10 
mM ATP and 5 units of T4 polynucleotide kinase. This 



20 The completed synthetic DNA insert was ligated directly 
into the Lambda Zap™ n vector described in Example la(t) 
that had been previously digested with the restriction 
enzymes. Not I and Xho 1 The ligation mixture was pack- 
aged according to . the manufacture's instructions using 

25 Gigapack n Gold packing extract available from Stratagene. 
La Jolla, Calif. The packaged ligation mixture was plated on 
XLl-Blue cells (Stratagene). Individual lambda plaques 
were cored and the inserts excised according to the in vivo 
excision protocol for Lambda Zap™ II provided by the 

30 manufacturer (Stratagene). This in vivo excision protocol 
moved the cloned insert from the Lambda Hc2 vector into a 
phagemid vector to allow easy for manipulation and 
sequencing. The accuracy of the above cloning steps was 
confirmed by sequencing the insert using the Sanger dideoxy 

35 method described in by Sanger et al., Proc. Natl. Acad. Sci. 
USA 74:5463-5467. (1977) and using the manufacture's 
instructions in the AMV Reverse Transcriptase 33 S-AXP 
sequencing kit (Stratagene). The sequence of the resulting 
double- stranded synthetic DNA insert in the V„ expression 

40 vector (Lambda Hc2) is shown in FIG. 3. The sequence of 
each strand (top and bottom) of Lambda Hc2 is listed in the 
sequence listing as SEQ. ID. NO. 1 and SEQ. ID. NO. 2, 
respectively. The resultant Lambda Hc2 expression vector is 
shown in FIG. 4. 

45 (iii) Preparation of Lambda Lc2 

To express a plurality of V L -coding DNA homologs in an 
E. coli host cell, a vector designated Lambda Lc2 was 
constructed having the capacity to place the V^-coding DNA 
homologs in the proper reading frame, provided a ribosome 

50 binding site as described by Shine et al.. Nature, 25434 
(1975). provided the pelB gene leader sequence secretion 
signal that has been previously used to successfully secrete 
Fab fragments in E. coli by Lei et al.. / floe, 169:4379 
(1987) and Better et al.. Science, 240:1041 (1988), and also 

55 provided a polynucleotide containing a restriction endonu- 
clease site for cloning. Lambda Lc2 has been previously 
described by Huse et al.. Science, 246:1275-1281 (1989). 

A synthetic DNA sequence containing all of the above 
features was constructed by designing single stranded poly- 

60 nucleotide segments of 20-60 bases that would hybridize to 
each other and form the double stranded synthetic DNA 
sequence shown in FIG. 5. The sequence of each individual 
single-stranded polynucleotide segment (01-08) within the 
double stranded synthetic DNA sequence is shown in Table 

65 4. 

Polynucleotides 02, 03, 04. 05. 06 and 07 (Table 4) were 
kinased by adding 1 ul (0.1 ugful) of each polynucleotide 
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and 20 units of T 4 polynucleotide kinase to a solution 
containing 70 mM Tris-HCl. pH 7.6, 10 mM MgCl. 5 raM 
DTT. 10 mM bcta-mercaptoethanol. 500 mg/ml of BSA. The 
solution was maintained at 37° C. for 30 minutes and the 
reaction stopped by maintaining the solution at 65° C for 10 
minutes. The 20 ng each of the two end polynucleotides. 01 
and 08. were added to the above kinasing reaction solution 
together with Vio volume of a solution containing 20.0 mM 
Tris-HCl. pH 7.4. 2.0 mM MgCl and 15.0 mM sodium 
chloride (NaCl). This solution was heated to 70° C. for 5 
minutes and allowed to cool to room temperature, approxi- 
mately 25° C. over 1.5 hours in a 500 ml beaker of water. 
During this time period all 8 polynucleotides annealed to 
form the double stranded synthetic DNA insert shown in 
FIG. 5. The individual polynucleotides were covalently 
linked to each other to stabilize the synthetic DNA insert by 
adding 40 ul of the above reaction to a solution containing 
50 mi Tris-HCl, pH 7.5, 7 ml MgCl. 1 mm DTT. 1 ram ATP 
and 10 units of T4 DNAligase. This solution was maintained 
at 37° C for 30 minutes and then the T4 DNA ligase was 
inactivated by maintaining the solution at 65° C for 10 
minutes. The end polynucleotides were kinased by mixing 
52 ul of the above reaction. 4 ul of a solution containing 10 
mM ATP and 5 units of T4 polynucleotide kinase. This 
solution was maintained at 37° C. for 30 minutes and then 
theT4 polynucleotide kinase was inactivated by m aintain i ng 
the solution at 65° C. for 10 minutes. 



Shine -Dal gam o ribosome binding site as shown in the 
sequence in FIG. 5 and in SEQ. ID. NO. 3. A Spe I 
restriction site is also present in Lambda Hc2 as shown in 
FIGS. 3 and 4 and in SEQ. ID. NO. 1. A combinatorial 

5 vector, designated pComb. was constructed by combining 
portions of Lambda Hc2 and Lc2 together as described in 
Example la(iv) below. The. resultant combinatorial pComb 
vector contained two Spe I restriction sites, one provided by 
Lambda Hc2 and one provided by Lambda Lc2, with an 

10 EcoR I site in between. Despite the presence of two Spe I 
restriction sites. DNA homologs having Spe I and EcoR I 
cohesive termini were successfully directionally ligated into 
a pComb expression vector previously digested with Spe I 
and EcoR I as described in Example lb below. The prox- 

15 irnity of the EcoR I restriction site to the 3* Spe I site, 
provided by the Lc2 vector, inhibited the complete digestion 
of the 3' Spe I site. Thus, digesting pComb with Spe I and 
EcoR I did not result in removal of the EcoR I site between 
the two Spe I sites. 

20 The presence of a second Spe I restriction site may be 
undesirable for ligations into a pComb vector digested only 
with Spe I as the region between the two sites would be 
eliminated. Therefore, a derivative of Lambda Lc2 lacking 
the second or 3* Spe I site, designated Lambda Lc3. is 

25 produced by first digesting Lambda Lc2 with Spe I to form 
a linearized vector. The ends are filled in to form blunt ends 
which are ligated together to result in Lambda Lc3 lacking 



TABLE 4 



SEQ. 
ID. NO. 



(32) 01) 5' TGAAITCTAAACTAGTCGCCAAGGAGACAGTCAr 3' 

(33) 02) 5" AATGAAATACCnArTGCCTACGGCAGCCGCTGGATT 3' 

(34) 03) 5' GTIATTACTCCXTTOCCCAACCAGarATGCJCC 3' 

(35) 04) 5' GAGCTCGrTCAGTTCTrAGAGTTAAGCGGCCG 3' 

(36) 05) 5' GTATTTC ATTATC ACTCTCTC CTTGGCG ACIAU 1 1 lAGAA- 

TTCAAGCT 3" 

(37) 06) 5' CAGCGAGTAAX\ACAAITXAGCGGCTGCCGTAGGCAAIAG 3* 

(38) 07) 5* TGACGAGCTCGGCCATGGCTGGTKKjG 3* 
p9) 08) 5' TCGACCKXXTGCTIAACTCTAGAAC 3* 



The completed synthetic DNA insert was ligated directly 
into the Lambda Zap™ II vector described in Example 
l(aXi) that had been previously digested with the restriction 
enzymes Sac I and Xho L The ligation mixture was pack- 
aged according to the manufacture's instructions using 
Gigapack II Gold packing extract (Stratagene). The pack- 
aged ligation mixture was plated on XLl-Blue cells 
(Stratagene). Individual lambda plaques were cored and the 
inserts excised according to the in vivo excision protocol for 
Lambda Zap™ II provided by the manufacturer 
(Stratagene). This in vivo excision protocol moved the 
cloned insert from the Lambda Lc2 vector into a plasmid 
phagemid vector allow for easy manipulation and sequenc- 
ing. The accuracy of the above cloning steps was confirmed 
by sequencing the insert using the manufacture's instruc- 
tions in the AMV Reverse Transcriptase 35 S-dATP sequenc- 
ing kit (Stratagene). The sequence of the resulting Lc2 
expression vector (Lambda Lc2) is shown in FIG. 5. Each 
strand is separately listed in the Sequence Listing as SEQ. 
ID. NO. 3 and SEQ. ID. NO. 4. The resultant Lc2 vector is 
schematically diagrammed in FIG. 6. 

A preferred vector for use in this invention, designated 
Lambda Lc3. is a derivative of Lambda Lc2 prepared above. 
Lambda Lc2 contains a Spe I restriction site (ACTAGT) 
located 3' to the EcoR I restriction site and 5' to the 



a Spe I site. Lambda Lc3 is a preferred vector for use in 
constructing a combinatorial vector as described below. 

45 (iv) Preparation of pComb 

Fhagemids were excised from the expression vectors 
Lambda Hc2 or Lambda Lc2 using an in vivo excision 
protocol described above. Double stranded DNA was pre- 
pared from the phagemid-containing cells according to the 

50 methods described by Holmes et aL. Anal. Biochenu, 
114:193 (1981). The phagemids resulting from in vivo 
excision contained the same nucleotide sequences for anti- 
body fragment cloning and expression as did the parent 
vectors, and are designated phagemid Hc2 and Lc2, corre- 

55 spending to Lambda Hc2 and Lc2, respectively. 

For the construction of combinatorial phagemid vector 
pComb, produced by combining portions of phagemid Hc2 
and phagemid Lc2. phagemid Hc2 was first digested with 
Sac I to remove the restriction site located 5* to the LacZ 
promoter. The linearized phagemid was then blunt ended 

60 with T4 polymerase and ligated to result in a Hc2 phagemid 
lacking a Sac I site. The modified Hc2 phagemid and the Lc2 
phagemid were then separately restriction digested with Sea 
I and EcoR I and the linearized phagemids were ligated 
together at their respective cohesive ends. The ligated 

65 phagemid vector was then inserted into an appropriate 
bacterial host and transformants were selected on the anti- 
biotic ampicillin. 



5J1 

37 

Selected ampicillin resistant transformants were screened 
for the presence of two Not I sites. The resulting ampiciliin 
resistant combinatorial phageraid vector was designated 
pComb. the schematic organization of which is shown in 
FIG. 7. The resultant combinatorial vector. pComb. con- 
sisted of a DNA molecule having two cassettes to express 
two fusion proteins and having nucleotide residue sequences 
for the following operativeiy linked elements listed in a 5* to 
3' direction: a first cassette consisting of an inducible LacZ 
promoter upstream from the LacZ gene; a Not I restriction 
site; a ribosome binding site; a pelB leader; a spacer; a 
cloning region bordered by a 5* Xho and 3* Spe I restriction 
site; a decapeptide tag followed by expression control stop 
sequences; an EcoR I restriction site located 5' to a second 
cassette consisting of an expression control ribosome bind- 
ing site; a pclB leaden a spacer region; a cloning region 
bordered by a 5* Sac I and a 3' Xba I restriction site followed 
by expression control stop sequences and a second Not I 
restriction site. 

A preferred combinatorial vector for use in this invention, 
designated pComb2, is constructed by combining portions 
of phagemid Hc2 and phagdmid Lc3 as described above for 
preparing pComb. The resultant combinatorial vector, 
pComb2, consists of a DNA molecule having two cassettes 
identical to pComb to express two fusion proteins identically 
to pComb except that a second Spe I restriction site in the 
second cassette is eliminated. 

b. Construction of Vectors pCombVUI and 
pCombin for Expressing Fusion Proteins Having a 
Bacteriophage Coat Protein Membrane Anchor 

Because of the multiple endonuclease restriction cloning 
sites, the pComb phagemid expression vector prepared 
above is a useful cloning vehicle for modification for the 
preparation of an expression vector of this invention. To that 
end. pComb is digested with EcoR I and Spe I followed by 
phosphatase treatment to produce linearized pComb. 

(i) Preparation of pCombVm 

A PCR product produced in Example 2g and having a 
nucleotide sequence that defines a filamentous bacterioph- 
age coat protein VHI (cpVUT) membrane anchor domain and 
cohesive Spe I and EcoR I termini was admixed with the 
linearized pComb to form a ligation admixture. The cpVIQ- 
membrane anchor-encoding PCR fragment was direction- 
ally ligated into the pComb phagemid expression vector at 
corresponding cohesive termini, that resulted in forming 
pCombVm (also designated pComb8). pCombVUI contains 
a cassette defined by the nucleotide sequence shown in SEQ. 
ID. NO. 116 from nucleotide base 1 to base 208. and 
contains a pelB secretion signal operativeiy linked to the 
cpVTH membrane anchor. 

A preferred phagemid expression vector for use in this 
invention, designated either pComb2-VHl or pComb2-8, is 
prepared as described above by directionally ligating the 
cpVm membrane anchor-encoding PCR fragment into a 
pComb2 phagemid expression vector via Spe I and EcoR I 
cohesive terminii. The pComb2-8 has only one Spe I restric- 
tion site. 

(ii) Preparation of pCombin 

A separate phagemid expression vector was constructed 
using sequences encoding bacteriophage cpm membrane 
anchor domain. A PCR product denning the cpIH membrane 
anchor and Spe I and EcoR I cohesive termini was prepared 
as described for cpVTH. the details of which are described in 
Example 2g. The cpHI- derived PCR product was then 
ligated into linearized pComb vector to form the vector 
pCombm (also designated pComb3). 
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A preferred phagemid expression vector for use in this 
invention, designated either pComb2-III or pComb2-3. is 
prepared as described above by directionally ligating the 
cpHI membrane anchor-encoding PCR fragment into a* 
5 pComb2 phagemid expression vector via Spe I and Spe I 
cohesive terminii. The pComb2-3 has only one Spe I restric- 
tion site. 

c. Construction of pCBAK Vectors Having a 
Chloramphenicol Resistance Marker 

In order to utilize a different selectable marker gene, such 
as chloramphenicol acetyl transferase (CAT)- for the selec- 
tion of bacteria transformed with a vector of this invention, 
expression vectors based on pComb were developed having 
15 a gene encoding CAT and are designated pCBAK vectors. 
The pCBAK vectors are prepared by combining portions of 
pCB and pComb. 

(i) Preparation of pCB 

pBlueScript phagemid vectors. pBC SK(-) and pBS SK(- 

20 ). (Stratagene), were modified and combined to generate a 
third vector designated pCB as described below. 

pBC SK(-), which contains a chloramphenicol resistance 
selectable marker gene, was digested with Bst BI and blunt 
ended with T4 polymerase. A second digestion with Pvu I 

25 allowed for the removal of a 1 kilobase (kb) fragment 
leaving a 2.4 kb linearized vector which retained die CAT 
selectable resistance marker gene, an inducible LacZ pro- 
moter upstream from the LacZ gene and a ColEl origin 
region. The 2.4 kb fragment was recovered. The pBS SK(-) 

30 vector was digested with Aat II and blunt ended with T4 
polymerase. A second digestion with Pvu I allowed for the 
isolation of an 800 base pair (bp) fragment containing the f 1 
origin of replication. Ligation of the pBS derived 800 bp f 1 
fragment with the 2.4 kb pBC fragment created a pCB 

35 precursor vector containing a Sac I site, an fl origin of 
replication- a CAT selectable resistance marker gene, ColEl 
origin, a multiple cloning site (MCS) flanked by T 3 and T 7 
promoters, and an inducible LacZ promoter upstream from 
LacZ gene. 

40 The pCB precursor vector was then digested with Sac I 
and blunt-ended with T4 polymerase. The T4 polymerase - 
treated pCB vector was then religated to form pCB vector 
and is lacking a Sac I site. 

(ii) Preparation of pCBAKO 

45 The pCB vector containing the CAT selectable resistance 
marker gene was digested with Sac II and Apa I and treated 
with phosphatase to prevent religation and to form linearized 
pCB vector. The pComb vector prepared in Example l(aXiv) 
was restriction digested with Sac II and Apa I to release a 

50 fragment containing nucleotide residue sequences starting 5' 
to the LacZ promoter and extending past the 3' end of the 
second Not I site. The Sac II and Apa I pComb DNA 
fragment was then directionally ligated into the similarly 
digested pCB vector to form phageraid expression vector 

55 pCBAKO. Preferred pCBAK expression vectors are con- 
structed with pComb2. The resultant pCBAK expression 
vector contains only one Spe I restriction site. 

(iii) Preparation of pCBAK8 

To prepare a pCBAK-based phagemid expression vector 
60 which encodes a bacteriophage coat protein membrane 
anchor domain in the expressed fusion protein. pCB 
phagemid cloning vector prepared in Example lc(ii) was 
linearized by digestion with Sac n and Apa L The pCom- 
bVUI phagemid expression vector, prepared in Example 
65 lb(i). was restriction digested with Sac II and Apa I to form 
a fragment containing a nucleotide residue sequence starting 
5* to the LacZ promoter and extending past the 3* end of the 
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second Not 1 site. The fragment was directioDaUy iigated 
into the linearized pCB cloning vector to form phagemid 
expression vector pCBAK8. 
(iv) Preparation of pCBAK3 

The phagemid expression vector. pCBAK3. for the 
expression of fusion protein having cpHI membrane anchor 
domains, was similarly constructed by direcrionalry ligating 
the Sac H and Apa I restriction digested fragment from 
pCombHI with Sac H and Apa I linearized pCB cloning 
vector. 

2. Construction of Dicistronic Expression Vectors 
for Expressing Anti-NPN Heterodimer on Phage 
Surfaces 

In practicing this invention, the heavy (Fd consisting of 
V H and C w l) and light (kappa) chains (V^. CJ of antibodies 
are first targeted to the periplasm of E. coli for the assembly 
of heteiodimeric Fab molecules. In order to obtain expres- 
sion of antibody Fab libraries on a phage surface, the 
nucleotide residue sequences encoding either the Fd or light 
chains must be operatively linked to the nucleotide residue 
sequence encoding a filamentous bacteriophage coat protein 
membrane anchor. Two preferred coat proteins for use in this 
invention in providing a membrane anchor are vm and ID 
(cpvm and cpIIL respectively). In the Examples described 
herein, methods for operatively linking a nucleotide residue 
sequence encoding a Fd chain to either cpVTQ or cpEH 
membrane anchors in a fusion protein of this invention are 
described. 

In a phagemid vector, a first and second cistron consisting 
of translatable DNA sequences are operatively linked to 
form a dicistronic DNA molecule. Each cistron in the 
dicistronic DNA molecule is linked to DNA expression 
control sequences for the coordinate expression of a fusion 
protein. Fd-cpVUI or Fd-cpm. and a kappa light chain. 

The first cistron encodes a periplasmic secretion signal 
(pelB leader) operatively linked to the fusion protein, either 
Fd-cpVUI or Fd-cpUL The second cistron encodes a second 
pelB leader operatively linked to a kappa light chain. The 
presence of the pelB leader facilitates the coordinated but 
separate secretion of both the fusion protein and light chain 
from the bacterial cytoplasm into the periplasmic space. 

The process described above is schematically dia- 
grammed in FIG. 8. Briefly, the phagemid expression vector 
carries a chloramphenicol acetyl transferase (CAT) select- 
able resistance marker gene in addition to the Fd-cpVUI 
fusion and the kappa chain. The f 1 phage origin of replica- 
tion facilitates the generation of single stranded phagemid. 
The isopropyl thiogalactopyranoside (IPTG) induced 
expression of a dicistronic message encoding the Fd-cpvm 
fusion (V„. C m * cpVm) and the light chain (V^. CJ leads 
to the formation of heavy and light chains. Each chain is 
delivered to the periplasmic space by the pelB leader 
sequence, which is subsequently cleaved. The heavy chain is 
anchored in the membrane by the cpVTQ membrane anchor 
domain while the light chain is secreted into the periplasm.' 
The heavy chain in the presence of light chain assembles to 
form Fab molecules. This same result can be achieved if. in 
the alternative, the light chain is anchored in the membrane 
via a light chain fusion protein having a membrane anchor 
and heavy chain is secreted via a pelB leader into the 
periplasm. 

With subsequent infection of £. coli with a helper phage, 
as the assembly of the filamentous bacteriophage progresses, 
the coat protein VTO is incorporated along the entire length 
of the filamentous phage particles as shown in FIGS. 8 and 
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9. If cpm is used, the accumulation occurs on the tail of the 
bacteriophage. The advantage of the utilization of mem- 
brane anchors from cpvm over cpm is two fold. Firstly, a 
multiplicity of binding sites, consisting of approximately 

5 2700 cpvm monomers assembled in a tubular array, exist 
along the particle surface. Secondly, the construct does not 
interfere with phage infectivity. 
a. Polynucleotide Selection 

The nucleotide sequences encoding the immunoglobulin 

10 protein CDR*s are highly variable. However, there arc 
several regions of conserved sequences that flank the V 
region domains of either the light or heavy chain, for 
instance, and that contain substantially conserved nucleotide 
sequences, i.e., sequences that will hybridize to the same 

15 primer sequence. Therefore, polynucleotide synthesis 
(amplification) primers t that hybridize to the conserved 
sequences and incorporate restriction sites into the DNA 
homolog produced that are suitable for operatively linking 
the synthesized DNA fragments to a vector were con- 

20 structed. More specifically, the primers are designed so that 
'the resulting DNA homologs produced can be inserted into 
an expression vector of this invention in reading frame with 
the upstream translatable DNA sequence at the region of the 
vector containing the directional ligation means. 

25 (i) V w Primers 

For amplification of the V H domains, primers are 
designed to introduce cohesive termini compatible with 
directional ligation into the unique Xho I and Spe I sites of 
the phagemid Hc2 expression vector. For example, the 3* 

30 primer (primer 12A in Table 5). was designed to be comple- 
mentary to the mRNA in the } H region. In all cases, the 5' 
primers (primers 1-10. Table 5) were chosen to be comple- 
mentary to the first strand cDNA in the conserved 
N- terminus region (antisense strand). Initially amplification 

35 was performed with a mixture of 32 primers (primer 1. Table 
5) that were degenerate at five positions. Hybridoma mRNA 
could be amplified with mixed primers, but initial attempts 
to amplify mRNA from spleen yielded variable results. 
Therefore, several alternatives to amplification using the 

40 mixed 5' primers were compared. 

The first alternative was to construct multiple unique 
primers, eight of which are shown in Table 5. corresponding 
to individual members of the mixed primer pool. The 
individual primers 2-9 of Table 5 were constructed by 

45 incorporating either of the two possible nucleotides at three 
of the five degenerate positions. 

The second alternative was to construct a primer contain- 
ing inosine (primer 10. Table 5) at four of the variable 
positions based on the published work of Takahashi. et al., 

50 Prvc. Natl Acad, Set. (USA.). 82:1931-1935. (1985) and 
Ohtsuka et al.. J. Biol Chetru, 260:2605-2608, (1985). This 
primer has the advantage that it is not degenerate and. at the 
- same time minimizes the negative efFects of mismatches at 
the unconserved positions as discussed by Martin et at, Nuc. 

55 Acids Res. P 13:8927 (1985). However, it was not known if 
the presence of inosine nucleotides would result in incor- 
poration of unwanted sequences in the cloned regions. 
Therefore, inosine was not included at the one position that 
remains in the amplified fragments after the cleavage of the 

60 restriction sites. As a result inosine was not in the cloned 
insert. 

Additional V„ amplification primers including the unique 
3* primer were designed to be complementary to a portion of 
the first constant region domain of the gamma 1 heavy chain 
65 mRNA (primers 16 and 17, Table 5). These primers will 
produce DNA homologs containing polynucleotides coding 
for amino acids from the V H and the first constant region 
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domains of the heavy chain. These DNA homologs can 
therefore be used to produce Fab fragments rather than an 

Additional unique J primers designed to hybridize to ^ 
similar regions of another class of immunoglobulin heavy 
chain such as IgM. IgE and IgA are contemplated. Other 3' 
primers that hybridize to a specific region of a specific class 
of CH l constant region and are adapted for transferring the 
V H domains amplified using this primer to an expression 
vector capable of expressing those V H domains with a 
different class of heavy or light chain constant region are 
also contemplated. 

As a control for amplification from spleen or hybridoma 
mRNA. a set of primers hybridizing to a highly conserved w 
region within the constant region IgG. heavy chain gene 
were constructed. The 5' primer (primer 1L Table 5) is 
complementary to the cDNA in the C H 2 region whereas the 
3* primer (primer 13. Table 5) is complementary to the 
mRNA in the C„3 region. It is believed that no mismatches M 
were present between these primers and their templates. 

The primers used for amplification of heavy chain Fd 
fragments for construction of Fabs are shown at least in 
Table 5. Amplification was performed in eight separate 
reactions, each containing one of the 5* primers (primers 25 
2-9) and one of the 3* primers (primer 16). The remaining 
5* primers that have been used for amplification in a single 
reaction are either a degenerate primer (primer 1) or a primer 
that incorporates inosine at four degenerate positions 
(primer 10. Table 5. and primers 17 and 18, Table 6). The 30 
remaining 3* primer (primer 14. Table 6) has been used to 
construct F v fragments. Many of the 5* primers incorporate 
a Xho I site, and the 3' primers incorporate a Spe I restriction 
site for insertion of the V w DNA homolog into the phagemid 
Hc2 expression vector (FIG. 4). 35 

V H amplification primers designed to amplify human 
heavy chain variable regions are shown in Table 6. One of 
the 5* heavy chain primer contains inosine residues at 
degenerate nucleotide positions allowing a single j*imer to 
hybridize to a large number of variable region sequences. 40 
Primers designed to hybridize to the constant region 
sequences of various IgG mRNAs are also shown in Table 
6. 



(ii) V L Primers 

The nucleotide sequences encoding the V L CDRs are 
highly variable. However, there are several regions of con- 
served sequences that flank the V L CDR domains including 
the J L , framework regions and V L leader/promotor. 
Therefore, amplification primers were constructed that 
hybridized to the conserved sequences and incorporate 
restriction sites that allow cloning the amplified fragments 
into the phagemid Lc2 vector cut with Sac I and Xba L 

For amplification of the V L CDR domains, the 5' primers 
(primers 1-8 in Table 6) were designed to be complementary 
to the first strand cDNA in the conserved N-terminus region. 
These primers also introduced a Sac I restriction endonu- 
clease site to allow the DNA homolog to be doned into 
the phagemid Lc2 expression vector. The 3' V t amplification 
primer (primer 9 in Table 6) was designed to be comple- 
mentary to the mRNA in the } L regions and to introduce the 
Xba.I restriction endonuclease site required to insert the V L 
DNA homolog into the phagemid Lc2 expression vector 
(FIG. 6). 

Additional 3* V L amplification primers were designed to 
hybridize to the constant region of either kappa or lambda 
mRNA (primers 10 and 11 in Table 6). These primers allow 
a DNA homolog to be produced containing polynucleotide 
sequences coding for constant region amino acids of either 
kappa or lambda chain. These primers make it possible to 
produce an Fab fragment rather than an Fyv 

The primers used for amplification of kappa light chain 
sequences for construction of Fabs arc shown at least in 
Table 6. Amplification with these primers was performed in 
5 separate reactions, each containing one of the 5' primers 
(primers 3-6. and 12) and one of the 3' primers (primer 13). 
The remaining 3' primer (primer 9) has been used to 
construct F v fragments. The 5' primers contain a Sac I 
restriction site and the 3* primers contain a Xba I restriction 
site. 

V L amplification primers designed to amplify human light 
chain variable regions of both the lambda and kappa iso- 
types are also shown in Table 6. 

All primers and synthetic polynucleotides described 
herein, including those shown in Tables 3-7 were either 
purchased from Research Genetics in Huntsville. Ala. or 
synthesized on an Applied Biosystems DNA synthesizer, 
model 381 A, using the manufacturer's instruction. 



TABLE 5 



(1) yACK7T(CA3X^A)A<G/A)CT(Ga7nXX}AarC(T/A)GG 3* 

(2) 5'AOCrrCCAGCTrjCTCGAGTCrGG 3' 

(3) 5*AGGTCCAGCTGCTCGAGTCAGO 3' 

(4) 5AGGTCCAGCTTCTCGAGTCTGG 3* 

(5) yAGGTCCAGCTTXTTCGAGTCAGG 3* 

(6) yAGGTCCAACTGCTCGAGTCTGG 3* 

(7) SAGGTCCAACTGCTCGAGTCAGG 3* 

(8) 5AGGTCCAACHTC1CGAGTCTGG 3' 

(9) yAGGTCCAACTTCTCGAGTCAGG 3* 

(10) yAG<jIWANCTNCTCGAGTC(T/A)GO 3" 

(11) SGCCC AAGG ATGTGCTC ACC 3' 



degenerate 5' primer for the amplification 
of mouse and hurp"i heavy chain variable 
regions (Vh) 

Unique S primer for the amplification 

of mouse and human V H 

Unique 5* primer for the amplification 

of mouse and human V H 

Unique 5* primer for the amplification 

of mouse and human V K 

Unique 5' primer for the amplification 

of mouse and human V K 

Unique 5' primer for the amplification 

of mouse and h**^ 0 " V H 

Unique 5* primer for the amplification 

of mouse and hi*™**" V H 

Unique 5* primer for the amplification 

of mouse and h"*"*™ V H 

Unique 5* primer for the amplification 

of mouse and huraw V H 

5* degenerate primer coo raining ioosioc at 

4 degenerate positions for amplification of 

mouse V H 

y primer for amplification in the C H 2 
region of mouse IgGl 
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TABLE 5-continued 



(12) 5'CTAITAGAArrcAACGGIAACAGTGGTGCCTTGGCCCCA 3" 
(12A) yCTATTAACTAGTAACGGTAACAGTGGTGCCTTG CCCCA 3' 

(13) 5 CTCAGTATGGTGGTTGTGC 3' 

(14) 5'GCTACrAGTTTTGATTTCCACCTTGG 3* 

(15) 5*C AGC C ATGGCCGAC ATCC AG ATG 3' 

( 1 6) 5' AATTTTACIAGTC AC CTIGGTGCTGCTGGC 3* 

( 17) 5TATGC AACTAGTAC AACC AC AATCCCTOGGCAC AATTTT 3' 

(18) 5* ACOTITACIACTAC AATCCCTOGGCAC AAT 3* 



3* primer for amplification of V H and 
introducing a 3' Eco RI site 
3' primer for amplification of V H using 3' 
Spe I site 

3' primer for amplification in the C M 3 

region of mouse IgGl 

3' primer for amplification, of mouse kappa 

light chain variable regions (V L ) 

5' primer for amplification of mouse kappa 

light chain variable regions 

Unique 3' primer for amplification of V H 

including part of the mouse gamma 1 first 

constant region 

Unique 3' primer for amplification of Fd 
including part of mouse IgG 1 first constant 
region and hinge region 
3' primer for amplifying mouse Fd including 
part of the mouse IgG first constant region 
and part of the hinge region 



TABLE 6 



(1) y CCAGTTCCGAGCTCCrTTGTGACTCAC<7AATCT 3* 

(2) 3* CCACaTCCGACCTCCn-GTTGACX3CAGCCXX:CC 3* 

(3) 5* ccAGTTCCGAC<rrcxn"c<rrcACC^ r 

(4) 5- CCAGTITCOAGCTCCAGATGACCCAGTCTCCA 3' 

(5) S CCAGATCTTGAGCTCGTGATGACCCAGACrCCA J 

(6) S CCAGATCnX3AC<nXXrrcATGACrCAGTtrrCCA y 

(7) S CCAGATGTGAGCTCTTGATGACCCAAACTCAA 3* 

(8) y C C AG ATGTG AC<rrXXJTG ATAAC CC AGG ATG AA 3* 

(9) S GC AGCATTCTAG AGTTTC AGC TCC AGC TTGC C 3* 

(10) 5' CCGC CGTCTAG AAC ACTC ATTCCTTjTTG AAGCT 3* 

(11) S CCGC CGTCTAG AAC ATTCTGC AGG AG AC AG ACT 3* 

(12) S CCACOTCTGAGCTCGTOATGACACAGTCTCCA 3* 

(13) 5 GCCXXrGTCTAGAAnAACACTCAl iu.10 iTGAA 3' 

(14) S CTATTAACTAGTAACGOtAAO^GT^^ 
-(15) S AGGCTTACIACrrACAATCC CTGGGC AC AAT 3' 

(16) 5* CJCCGCICTAGAACACnCAlTCCTGTTGAA 3' 

(IT) y AGCOTiAICTICTCGAGTCTGC 3' 

(18) y AGGTTlAICnCTCG AGTC AGC 3' 



(19) y GTGCCACATCTCACCTCCTCATGACrc^ 3* 

(20) y TCCTTCTAGATTACTAACACTCTCCCCT 3* 

(21) y GC ATTCTAGACTATTATGAACATTCTCfTAGGGGC 3* 

(22) y CTtXTACAGGGTCCTX3GGCCGAGCTCtjT^ 3* 

(23) y AGTTCCAirn3CTCGAGTCTGG 3" 

(24) y GTGOQC ATGTGTGAG* M b' 1 U' fCACIACriTC<3CXjITITG AGCTC 3* 

(25) 5' ACCATCACIAGTACAAC^XTIGGGCTC 3* 



Unique 5' primer for the amplication 
of kappa light chain variable regions 
Unique 5" primer fbr the amplication 
of kappa light chain variable regions 
Unique y primer for the amplication 
of kappa Ught chain variable regions 
Unique S primer for the amplication 
of kappa Ught chain variable regions 
Unique S primer for the amplication 
of kappa light chain variable regions 
Unique S primer for the amplication 
of kappa light chain variable regions 
Unique 5" primer for the amplication 
of kappa light chain variable regions 
Unique S primer for the amplication 
of kappa light chain variable regions 
Unique 3' primer fbr amplification of 
kappa light chain variable regions 
Unique 3' primer for mouse kappa light 
chain amplification including the 
constant region 

Unique 3' primer for mouse lambda light 
chain amplification including the 
cons tam region 
Unique y primer for V L 
amplification 

Unique 3* primer for amplification of 
kappa light chain 

Unique 3* primer for amplification of 
mouse F v 

Unique 3' primer fbr amplification 

of mouse IgG Fd 

Unique 3' primer for amplification 

of mouse kappa light chain 

Degenerate 5* primer containing 

inosine at 4 degenerate positions for 

amplifying mouse V H 

Degenerate 5' primer containing 

inosine at 4 degenerate positions for 

amplifying mouse V H 

Unique S primer for human and mouse 

kappa V L amplification 

Unique 3' primer for kappa 

V L amplification 

Unique 3* primer fbr human, mouse and 
rabbit lambda V L amplification 
Unique y primer for human lambda 
V L amplification 

5' degenerate primer for human V H 
amplification containing inosine 
at 3 degenerate positions 
Unique 3' primer for human V H 
amplification 

Unique 3* primer for rmman IgGl Fd 
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TABLE 6-coDtinued 



(26) S AGCATCACTAGTACAAGATTTGGGCTC 3' 

(27) 5" AGGTGCAC3CK3CTCGAGTCGGG 3' 

(28) 5* AGGTGCAACTGC1CGAGTCTGG 3' 

(29) 5* AGGTGCAACTGCTCGAGTCGGO 3* 

(30) S TCCTTCTAG ATTACTAAC AC TCTCCC CTGTTG AA 3' 

(31) 5*ctgcacagggtcctgckk:cgagctcgtggtgactcag 3* 

(32) 5 GCATICIAGACTArrAACArrcrtnAGGGGC 3* 

(33) 5' ACCCAAGGACACCCTCATG 3' 

(34) 5* CTCAGTATGCi XUG 1 IttTGC 3' 

(35) 5' GTCTCACTACTCTCC ACCAAGGGCCCATCCGTC 3' 

(36) 5* ATA1ACTAGTGAGACAGTOACCAGGGTTCCTTXX3CCCXIA 3' 

(37) 5* ACGTCTAGATICCACCTTTjGTCCC 3' 

(38) 5* CCAlACX\GTCTATTAACAriXrrGTAGGGGC 3' 

(39) 5* CCCKjAATTCTTATCATTXACCCGGAGA 3* 

(40) 5* TCTX3CACTAGTTGGAAIXXXX:aCATGCAG 3* 



amplification 

Unique 3' primer for amplification 
of human variable regions (V H ) 
Unique 3' primer for amplification 
of human variable regions (V H ) 
Unique 3' primer for amplification 
of human variable regions (V H ) 
Unique 3' primer for amplification 
of human variable regions (V H ) 
3' primer in human kappa light chain 
constant region 

5* primer for amplification of human 
lambda Light chain variable regions 
3' primer in human lambda light chain 
constant region 

Control primer hybridizing to the 

human CH 2 region 

Control primer hybridizing to the 

human CH 3 region 

5' primer for amplifying human IgG 

heavy chain first constant region 

3' primer for amplifying human heavy 

chain variable regions 

3' primer for amplifying h ltT " a n kapp 

chain variable regions 

5' primer for amplifying human kappa 

light chain constant region 

3' primer located in the CH3 region of 

human IgGl to amplify the entire heavy 

chain 

3' primer for aniplifying the Fd region 
of mouse IgM 



The 19 primers listed in Table 5 have been listed in the 
Sequence Listing and have been assigned the following 
SEQ. ID. NO.: 

(I) =SEQ. ID. NO. 40 
<2)=SEQ. ID. NO. 41 
(3>=SEQ. ID. NO. 42 
(4>=SEQ. ID. NO. 43 
(5>=SEQ. ID. NO. 44 
(6>=SEQ. ID. NO. 45 
(7)=SEQ. ID. NO. 46 
(8>=SEQ. ID. NO. 47 
(9>SEQ. ID. NO. 48 
(10)=SEQ. ID. NO. 49 

(II) =SEQ. ID. NO. 50 
(12>=SEQ. ID. NO. 51 
(12A)=SEQ. ID: NO. 52 

(13) =SEQ. ID. NO. 53 

(14) =SEQ. ID. NO. 54 

(15) =SEQ. ID. NO. 55 
(16>SEQ. ID. NO. 56 

(17) =SEQ. ID- NO. 57 

(18) =SEQ. ID. NO. 58 

The 40 primers listed as "(1)" through "(40)" in Table 6 
have also been individually and sequentially listed in the 
Sequence Listing beginning with SEQ. ID. NO. 59 through 
SEQ. ID. NO. 98. respectively. 

b. Preparation of a Repertoire of Genes Encoding Immuno- 
globulin Variable Domain 

Nitrophenylphosphonamidate (NPN) was selected as the 
ligand for receptor binding in preparing a heterodimeric 
receptor according to the methods of the invention. 

Keyhole limpet hemocyanin (KLH) was conjugated to 
NPN to form a NPN- KLH conjugate used for immunizing a 
mouse to produce an anti-NPN immune response and 
thereby provide a source of ligand specific heterodimeric 
receptor genes. 



The NPN-KLH conjugate was prepared by admixing 250 
ul of a solution containing 2.5 mg of NPN in dimethylfor- 
mamide with 750 ul of a solution containing 2 mg of KLH 
in 0.01 Molar (M) sodium phosphate buffer (pH 7.2). The 
two solutions were admixed by slow addition of the NPN 
solution to the KLH solution while the KLH solution was 
being agitated by a rotating stirring bar. Thereafter the 
admixture was maintained at 4° C. for 1 hour'with the same 
agitation to allow conjugation to proceed. The conjugated 
40 NPN-KLH was isolated from the nonconjugated NPN and 
KLH by gel filtration through Scphadex G-25. The isolated 
NPN-KLH conjugate was injected into mice as described 
below. 

The NPN-KLH conjugate was prepared for injection into 

43 mice by adding 100 ug of the conjugate to 250 ul of 
phosphate buffered saline (PBS). An equal volume of com- 
plete Freund's adjuvant was added and emulsified the entire 
solution for 5 minutes. A 129 G /x + mouse was injected with 
300 pi of the emulsion. Injections were given subcutane- 

50 ously at several sites using a 21 gauge needle. A second 
immunization with NPN-KLH was given two weeks later. 
This injection was prepared as follows: 50 micrograms (pg) 
of NPN-KLH were diluted in 250 pi of PBS and an equal 
volume of alum was admixed to the NPN-KLH solution. 

55 The mouse was injected intraperitoneally with 500 ul of the 
solution using a 23 gauge needle. One month later the mice 
were given a final injection of 50 ug of the NPN-KLH 
conjugate diluted to 200 ul in PBS. This injection was given 
intravenously in the lateral tail vein using a 30 gauge needle. 

60 Five days after this final injection the mice were sacrificed 
and total cellular RNA was isolated from their spleens. 

Total cellular RNA was prepared from the spleen of a 
single mouse immunized with KLH-NPN as described 
above using the RNA preparation methods described by 

65 Chomczynski et al.. Anal Biochenu, 162:156-159 (1987) 
and using the RNA isolation kit (Stratagene) according to 
the manufacturer's instructions. Briefly, immediately after 
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removing the spleen from the immunized mouse, the tissue polynucleotides, i.e.. greater than about 10* different 

was homogenized in 10 ml of a denaturing solution con- V^-coding genes, and contains a similar number of 

taining 4.0M guanine isothiocyanate. 0.25M sodium citrate V L -coding genes. Thus, the mRNA population represents a 

at pH 7.0, and 0.1M beta-mercaptoethanol using a glass repertoire of variable region-coding genes, 

homogenizer. One ml of sodium acetate at a concentration of 5 c . Preparation of DNA Homologs 

2M at pH 4.0 was admixed with the homogenized spleen. ^ preparation for PCR amplification. mRNA prepared 

One ml of phenol that had been previously saturated with above is used as a template for cDNA synthesis by a primer 

H 2 0 was also admixed to the denaturing solution attaining extension reaction. In a typical 50 pi transcription reaction, 

the homogenized spleen. Two ml of a chloroform:isoamyl ug ^ spleen mRNA in water is first hybridized 

alcohol (24:1 v/v) mixture was added to this homogenate w (anncaled) with 500 ng (50 .0 pmol) of the 3' V„ primer 

The homogenate was mixed vigorously for ten seconds and ef nA Tab|e 5) at 650 c fof fivc MnlAeSm 

maintained on ice for 15 niinutes The homogenate was then s ;, b ucnU y ^ IldxSm k ^ jusXtd t0 1.5 mM dATR 

transferred to a thick-walled 50 ml propylene centti- dCT? H dGr ^ dTTP. 40 mM Tris-HCl. pH 8.0. 8 mM 

Sp^l^^^ 15 Murine Uukernia vinis Reverse transenptase (S^atagene). 

a fresh 5C ^mi^lypropylene centrifuge tube and mixed with 26 units, is added and the solution is maintained for 1 hour 

an equal volume of isopropyl alcohol. This solution was at 37 C . 

maintained at -20° C. for at least one hour to precipitate the PCR amplification is performed in a 100 pi reaction 

RNA. The solution containing the precipitated RNA was containing the products of the reverse transcription reaction 

centrifuged at 10.000xg for twenty minutes at 4° C. The 20 (approximately 5 ug of the cDNA/RNA hybrid). 300 ng of 

pelleted total cellular RNA was collected and dissolved in 3 3' V w primer (primer 12A of Table 5). 300 ng each of the 5' 

ml of the denaturing solution described above. Three ml of V w primers (primers 2-10 of Table 5) 200 mM of a mixture 

isopropyl alcohol was added to the re-suspended total eel- of dNTP's, 50 mM KC1. 10 mM Tris-HCl pH 8.3. 15 mM 

lular RNA and vigorously mixed. This solution was main- MgCl 2 . 0.1% gelatin and 2 units of The nnus aquatic us (Taq) 

tained at -20° C. for at least 1 hour to precipitate the RNA. 25 DNA polymerase. The reaction mixture is overlaid with 

The solution containing the precipitated RNA was centri- mineral oil and subjected to 40 cycles of amplification. Each 

fuged at lO.OOOxg for ten minutes at 4° C The pelleted RNA amplification cycle includes denaturation at 92° C. for 1 

was washed once with a solution containing 75% ethanoL minute, annealing at 52° C. for 2 minutes and polynucleotide 

The pelleted RNA was dried under vacuum for 15 minutes synthesis by primer extension (elongation) at 72° C. for 1.5 

and then re-suspended in dimethyl pyrocarbonate (DEPQ 30 minutes. The amplified V^coding DNA homolpg contain- 

treated (DEPC-H 2 0) H 2 0. in g samples are then extracted twice with phenol/ 

Messenger RNA (mRNA) enriched for sequences con- chloroform, once with chloroform, ethanol precipitated and 

taining long poly A tracts was prepared from the total arc stored at -70° C. in 10 mM Tris-HG, pH 73. and 1 mM 

cellular RNA using methods described in Molecular Clon- EDTA. 

ing; A Laboratory Mani&LManhitisetai„eds.. Cold 3 3 Using unique 5* primers (2-9. Table 5), efilcicnt 
Harbor, N.Y. (1982). Briefly, one half of the total RNA V^-coding DNAhomolog synthesis and amplification from 
isolated from a single immunized mouse spleen prepared as the spleen mRNA is achieved as shown by agarose gel 
described above was re-suspended in one ml of DEPC-H 2 0 electrophoresis. The amplified cDNA (V^coding DNA 
and maintained at 65° C. for five minutes. One ml of 2xhigh homolog) was seen as a major band of the expected size (360 
salt loading buffer consisting of 100 mM Tris-HCl (Tris 40 bp). The amount the amplified V^oding polynucleotide 
[hydroxymethyt] amino methane hydrochloride), 1M fragment in each reaction is similar, indicating that all of 
sodium chloride (NaCl). 2.0 mM disodium emylene diamine these primers were about equally efBcient in initiating 
tetra-acetic add (EDTA) at pH 7.5 ? and 0.2% sodium amplification. The yield and quality of the amplification with 
dodecyl sulfate (SDS) was added to the re-suspended RNA these primers is reproducible. . 
and the mixture allowed to cool to room temperature. Hie 45 The primer containing inosine also synthesizes amplified 
mixture was then applied to an oligo-dT (Collaborative V^-coding DNA homologs from spleen mRNA 
Research Type 2 or Type 3) column that was previously reproducibly. leading to the production of the expected sized 
prepared by washing the oligo-dT with a solution containing fragment, of an intensity similar to that of the other ampli- 
0.1M sodium hydroxide and 5 mM EDTA and then equili- fied cDNAs. The presence of inosine also permits efBcient 
brating the column with DEPC-H 2 0. The eluate was col- 50 DNA homolog synthesis and amplification, clearly indicat- 
lected in a sterile polypropylene tube and reapplied to the ing that such primers are useful in generating a plurality of 
same column after heating the eluate for 5 minutes at 65° C. V^-coding DNA homologs. Amplification products 
The oligo dT column was then washed with 2 ml of high salt obtained from the constant region primers (primers 11 and 
loading buffer consisting of 50 mM Iris-HCL pH 7.5, 500 13, Table 5) are more intense indicating that amplification 
mM sodium chloride. 1 mM EDTA at pH 7.5 and 0. 1% SDS. 55 was more efficient, possibly because of a higher degree of 
The oligo aT column was then washed with 2 ml of homology between the template and primers. Following the 
lxmedium salt buffer consisting of 50 mM Tris-HCl. pH 7.5. above pr ocedures, a V^coding gene library is constructed 
100 mM. 1 mM EDTA and 0. 1% SDS. The messenger RNA from the products of eight amplifications, each performed 
was eluted from the oligo dT column with 1 ml of buffer with a different 5* prima. Equal portions of the products 
consisting of 10 mM Tris-HCl, pH 7.5, 1 mM EDTA, at pH 60 from each primer extension reaction are mixed and the 
7.5. and 0.05% SDS. The messenger RNA was purified by mixed product is then used to generate a library of 
extracting this solution with phenol/chloroform followed by V^-coding DNA homolog-containing vectors, 
a single extraction with 100% chloroform. The messenger DNA homologs of the are also prepared from the 
RNA was concentrated by ethanol precipitation and purified mRNA prepared as described above. In preparation 
re-suspended in DEPC H^O. 65 for PCR amplification. mRNA prepared according to the 
The messenger RNA (mRNA) isolated by the above above examples is used as a template for cDNA synthesis- 
process contains a plurality of different V w coding In a typical 50 ul transcription reaction, 5-10 ug of spleen 
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raRNA in water is first annealed with 300 ng (50.0 pmol) of 
the 3' primer (primer 14. Table 5). at 65° C. for five 
minutes. Subsequently, the mixture is adjusted to 1.5 mM 
dATP. dCTP, dGTR and tfTTT. 40 mM Tris-HCL pH 8.0. 8 
mM MgCl 2 . 50 mM NaCl. and 2 mM spermidine. Moloney- 
Murine Leukemia virus reverse transcriptase (Stratagene). 
26 units, is added and the solution is maintained for 1 hour 
at 37° C. The PCR amplification is performed in a 100 ul 
reaction containing approximately 5 ug of the cDNA/RNA 
hybrid produced as described above. 300 ng of the 3* V L 
primer (primer 14 of Table 5). 300 ng of the 5* primer 
(primer 16 of Table 5), 200 mM of a mixture of dNTP's. 50 
mM KC1. 10 mM Tris-HCL pH 83. 15 mM MgCl 2 , 0.1% 
gelatin and 2 units of Taq DNA polymerase. The reaction 
mixture is overlaid with mineral oil and subjected to 40 
cycles of amplification. Each amplification cycle includes 
denaturation at 92° C. for 1 minute, annealing at 52° C. for 
2 minutes and elongation at 72° C for 1.5 minutes. The 
amplified samples are then extracted twice with phenol/ 
chloroform, once with chloroform, ethanol precipitated and 
are stored at -70° C. in 10 mM Tris-HCl. 7.5 and 1 mM 
EDTA. 

d. Insertion of DNA Homologs into a DNA Expression 
Vector 

To prepare an expression library enriched in V w 
sequences, DNA homologs enriched in V H sequences are 
prepared according to Example 2c using the same set of 5* 
primers but with primer 12A (Table 5) as the 3' primer. The 
resulting PCR amplified products (2.5 ug/30 ul of 150 mM 
NaCL 8 mM Tris-HCl, pH 7.5, 6 mM MgS0 4 . 1 mM DTT. 
200 ug/ml BSA) are digested at 37° C. with restriction 
enzymes Xho I (125 units) and Spe I (125 units). In cloning 
experiments which required a mixture of the products of the 
amplification reactions, equal volumes (50 ul, 1-10 ug 
concentration) of each reaction mixture are combined after 
amplification but before restriction digestion. The W H 
homologs are purified on a 1% agarose gel using the 
standard electro-elution technique described in Molecular 
Cloning A Laboratory Manual. Maniatis et aL. eds.. Cold 
Spring Harbor. N.Y.. (1982). After gel electrophoresis of the 
digested PCR amplified spleen mRNA. the region of the gel 
containing DNA fragments of approximate 350 bps is 
excised, electro-eluted into a dialysis membrane, ethanol 
precipitated and re-suspended suspended in a TE solution 
containing 10 mM Tris-HCL pH 7.5 and 1 mM EDTA to a 
final concentration of 50 ng/ul. The resulting V H DNA 
homologs represent a repertoire of polypeptide genes having 
cohesive termini adapted for directional ligation to the 
vector Lambda Hc2. These prepared V H DNA homologs are 
then directly inserted by directional ligation into linearized 
Lambda Hc2 expression vector prepared as described below. 

The Lambda Hc2 expression DNA vector is prepared for 
inserting a DNA homolog by admixing 100 ug of this DNA 
to a solution containing 250 units each of the restriction 
endonucleases Xho 1 and Spe I (both from Bochringer 
Mannheim. Indianapolis, Ind.) and a buffer recommended 
by the manufacturer. This solution is maintained at 37 from 
1.5 hours. The solution is heated at 65° C. for 15 minutes top 
inactivate the restriction endonucleases. The solution is 
chilled to 30° C and 25 units of heat-killable (HK) phos- 
phatase (Epicenter. Madison, Wis.) and CaCl 2 is admixed to 
it according to the manufacturer's specifications. This solu- 
tion is maintained at 30° C. for 1 hour. The DNA is purified 
by extracting the solution with a mixture of phenol and 
chloroform followed by ethanol precipitation. The Lambda 
Hc2 expression vector is now ready for ligation to the V H 
DNA homologs prepared in the above examples. These 
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prepared V H DNA homologs are then directly inserted into 
the Xho I and Spe I restriction digested Lambda Hc2 
expression vector that prepared above by ligating 3 moles of 
\ H DNA horaolog inserts with each mole of the Hc2 

5 expression vector overnight at 5° C. Approximately 3-OxlO 3 
plague forming units are obtained after packaging the DNA 
with Gigapack n Bold (Stratagene) of which 50% arc 
recombinants. The ligation mixture containing the V H DNA 
homologs arc packaged according to the manufacturers 

10 specifications using Gigapack Gold II Packing Extract 
(Stratagene). The resulting Lambda Hc2 expression libraries 
are then transformed into XL 1 -Blue cells. 

To prepare a library enriched in sequences, PCR 
amplified products enriched in sequences are prepared 

15 according to Example 2c. These DNA homologs are 
digested with restriction enzymes Sac I and Xba I and the 
digested W L DNA homologs are purified on a 1% agarose gel 
as described above for the V H DNA homologs to form a 
repertoire of ^-polypeptide genes adapted for directional 

20 ligation. The prepared DNA homologs are then direc- 
tionally ligated into the Lambda Lc2 expression vector 
previously digested with the restriction enzymes. Sac I and 
Xba I as described for Lambda Hc2. The ligation mixture 
containing the DNA homologs is packaged to form a 

25 Lambda Lc2 expression library as described above and is 
ready to be plated on XLl-Biue cells, 
e. Randomly Combining V H and DNA Homologs on the 
Same Expression Vector 
The construction of a library containing vectors for 

30 expressing two cistrons that express heavy and light chains 
is accomplished in two steps. In the first step, separate heavy 
and light chain libraries are constructed in the expression 
vectors Lambda Hc2 and Lambda Lc2. respectively, as 
described using gene repertoires obtained from a mouse 

35 immunized with NPN-KLH. In the second step, these two 
libraries are combined at the antisymmetric EcoR I sites 
present in each vector. This resulted in a library of clones 
each of which potentially co-expresses a heavy and a light 
chain. The actual combinations are random and do not 

40 necessarily reflect the combinations present in the B-cell 
population in the parent animal 

The spleen mRNA resulting from the above immuniza- 
tions (Example 2b) is isolated and used to create a primary 
tfbrary of V,, gene sequences using the Lambda Hc2 expres- 

45 sioh vector. The primary library contains 13xl0 6 plaque- 
forming units (pfu) and can be screened for the expression 
of the decapeptide tag to determine the percentage of clones 
expressing V w and C H l (Fd) sequences. The sequence for 
this peptide is only in frame for expression following the 

50 cloning of a Fd (or V„) fragment into the vector. At least 
80% of the clones in the library express Fd fragments based 
on immunodetection of the decapeptide tag. 

The light chain library is constructed in the same way as 
the heavy chain and contains 2.5xl0 6 members. Plaque 

55 screening, using an anti-kappa chain antibody, indicates that 
60% of the library contained express light chain inserts. A 
small percentage of inserts results from incomplete dephos- 
phorylation of vector after cleavage with Sac I and Xba L 
Once obtained, the two libraries are used to construct a 

60 combinatorial library by crossing them at the EcoR I site. To 
accomplish the cross, DNA is first purified from each library. 

The Lambda Lc2 library prepared in Example 2d is 
amplified and 500 ug of Lambda Lc2 expression library 
phage DNA is prepared from the amplified phage stock 

65 using the procedures described in Molecular Cloning: A 
Laboratory Manual. Maniatis et al., eds.. Cold Spring 
Harbor, N.Y. (1982). Fifty ug of this amplified expression 
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library phage DNA is maintained in a solution containing bound NPN. In a typical procedure 30.000 phage were 

100 units of MLu I restriction endonuclease (Boehringer plated on XLl-Blue cells and duplicate lifts with nitrocei- 

Mannheinx Indianapolis, Ind.) in 200 ul of a buffer supplied lulose were screened for binding to NPN coupled to ,25 I 

by the endonuclease manufacturer for 1.5 hours at 37° C. labeled BSA. The BSA was iodinated following the 

The solution is then extracted with a mixture of phenol and 5 Chloramine-T method as described by Bolton et al., 

chloroform. The DNA is then ethanol precipitated and Biochem., 133:529-534 (1973). Duplicate screens of 80,000 

re-suspended in 100 ul of water. This solution is admixed recombinant phage from the light chain library and a similar 

with 100 units of the restriction endonuclease EcoR I number from the heavy chain library did not identify any 

(Boehringer) in a final volume of 200 ul of buffer containing clones which bound the antigen. In contrast, the screen of a 

the components specified by the manufacturer. This solution 10 similar number of clones from the Fab expression library 

is maintained at 37° C. for 1.5 hours and the solution is then identified many phage plaques that bound NPN. This obser- 

extracted with a mixture of phenol and chloroform. The vation indicates that under conditions where many heavy 

DNA was ethanol precipitated and the DNA re-suspended in chains in combination with light chains bind to antigen the 

same heavy or light chains alone do not Therefore, in the 

The Lambda Hc2 expression library prepared in Example 15 case of NPN, it is believed that there are many heavy and 

2d is amplified and 500 tug of Lambda Hc2 expression Light chains that only bind antigen when they are combined 

library phage DNA is prepared using the methods detailed with specific light and heavy chains respectively, 

above. 50 ug of this amplified library phage DNA is main- To assess the ability to screen large numbers of clones and 

tained in a solution containing 100 units of Hind HI restric- obtain a more quantitative estimate of the frequency of 

tion endonuclease (Boehringer) in 200 ul of a buffer supplied 20 antigen binding clones in the combinatorial library, one 

by the endonuclease manufacturer for 1.5 hours at 37° C. million phage plaques were screened and approximately 100 

The solution is then extracted with a mixture of phenol and clones which bound to antigen were identified. For six 

chloroform saturated with 0. 1M Tris-HCl. pH 7.5. The DNA clones which were believed to bind NPN. a region of the 

is then ethanol precipitated and re-suspended in 100 ul of plate containing the six positive and approximately 20 

water. This solution is admixed with 100 units of the 25 surrounding bacteriophage plaques was selected and each 

restriction endonuclease EcoR I (Boehringer) in a final plaque was cored, replated. and screened with duplicate lifts, 

volume of 200 ul of buffer containing the components As expected, approximately one in twenty of the phage 

specified by the manufacturer. This solution is maintained at specifically bound to antigen. Cores of regions of the plated 

37° C. for 1.5 hours and the solution is then extracted with phage believed to be negative did not give positives on 

a mixture of phenol and chloroform. The DNA is ethanol 30 replating. 

precipitated and the DNA re-suspended in TE. Clone 2b. one of the plaques which reacted with NPN. 

The restriction digested Hc2 and Lc2 expression libraries was excised according to an in vivo excision protocol where 

are ligated together. To that end, a DNA admixture consists 200 ul of phage stock and 200 ul of a F+ derivative of 

of 1 ug of Hc2 and 1 ug of Lc2 phage library DNA is XLl-Blue (A^ 100) (Stratagcnc) were admixed with 1 ul 

prepared in a 10 ul reaction using the reagents supplied in a 35 of M13mp8 helper phage (IxlO 10 pfu/miililiter (ml)) and 

ligation kit (Stratagene). The DNA admixture is wanned to maintained at 37° C. for 15 minutes. After a four hour 

45° C for 5 minutes to melt any cohesive termini that may maintenance in Luria-Bertani (LB) medium and heating at 

have reannealed. The admixture is then chilled to 0° C. to 70° C. for 20 minutes to heat kill the XLl-Blue cells, the 

prevent religation. Bacteriophage T4 DNA ligase (0. 1 Weiss phagemids were re-infected into XLl-Blue cells and plated 

units which is equivalent to 0.02 units as determined in an 40 onto LB plates containing ampicillin. This procedure con- 

exonuclease resistance assay) is admixed into the chilled verted the cloned insert from the Lambda Zap II vector into 

DNA solution along with 1 ul of 5 mM ATP and 1 ul a plasmid vector to allow easy manipulation and sequencing 

lOxbacteriophage T4 DNA ligase buffer (lOxbuffer is pre- (Stratagene). The phagemid DNA encoding the V H and part 

pared by admixing 200 mM TYis-HCl. pH 7.6. 50 mM of the W L was then determined by DNA sequencing using the 

Mgcl,.50 mM DTT and 500 ug/ml BSA) to form a ligation 45 Sanger dideoxy method described in Sanger et al.. Proc. 

admixture. After ligation for 16 hr at 4° C. 1 ul of the ligated Natl Acad. Scl, 74:5463-5467 ( 1977) using a Sequenase kit 

the phage DNA is packaged with Gigapack Gold H pack- according to manufacturer's instructions (US Biochemical 

aging extract and plated on XLl-Blue cells prepared accord- Corp.. Cleveland, Ohio). The nucleotide residue sequence of 

ing to the manufacturers instructions to form a Lambda Clone 2b Fd chain is listed in the Sequence Listing as SEQ. 

phage library of dicistronic expression vectors capable of 50 ID. NO. 99. The nucleotide residue sequences of the kappa 

expressing heavy and light chains derived from the NPN- light chain variable and constant regions are listed in the 

immunized mouse. A portion of the clones obtained are used Sequence Listing as SEQ. ID. NO. 100 and SEQ. ID. NO. 

to determine the effectiveness of the combination. 101. respectively. 

f. Selection of Anti-NPN Reactive Heterc«limex-Producing g. Preparation of a DNA Sequence Encoding a Filamentous 

Dicistronic Vectors 55 Phage Coat Protein Membrane Anchor 

The combinatorial Fab expression library prepared above cpVTH Membrane Anchor M13mpl8. a commercially 
in Example 2a was screened to identify clones having available bacteriophage vector (Pharmacia. Piscataway, 
affinity for NPN. To determine the frequency of the phage N.J.), was used as a source for isolating the gene encoding 
clones which co-expressed the light and heavy chain cpVTIL The sequence of the gene encoding the membrane 
fragments* duplicate lifts of the light chain, heavy chain and 60 anchor domain of cpVUI listed in Sequence Listing as SEQ. 
combinatorial libraries were screened as above for light and ID. NO. 102. was modified through PCR amplification to 
heavy chain expression. In this study of approximately 500 incorporate the restriction endonuclease sites. Spe I and 
recombinant phage, approximately 60% co-expressed light EcoR L and two stop codons prior to the EcoR I site. The 
and heavy chain proteins. corresponding amino acid residue sequence of the mem- 
All three libraries, the light chain, the heavy chain and the 65 brane anchor domain of cpVIH is listed as SEQ. ID. NO. 17, 
combinatorial, were screened to determine if they contained To prepare a modified cpVUI, replicative form DNA from 
recombinant phage that expressed antibody fragments which M13mpl8 was first isolated. Briefly, into 2 ml of LB 
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(Luria-Bertani medium). 50 ul of a culture of a bacterial 
strain carrying an F episome (JM107. JM109 or TGI) was 
admixed with a one tenth suspension of bacteriophage 
particles derived from a single plaque. The admixture was 
incubated for 4 to 5 hours at 37° C. with constant agitation. 
The admixture was then centrifuged at 12.000xg for 5 
minutes to pellet the infected bacteria. After the supernatant 
was removed, the pellet was resuspended by vigorous vor- 
texing in 100 ul of ice-cold solution L Solution I was 
prepared by admixing 50 mM glucose. 10 mM E0TA and 25 
mM Tris-HCl. pH 8.0, and autociaving for 15 minutes. 

To the bacterial suspension. 200 ul of freshly prepared 
Solution II was admixed and the tube was rapidly inverted 
five times. Solution II was prepared by admixing 0.2N 
NaOH and 1% SDS. To the bacterial suspension. 150 ul of 
ice-cold Solution in was admixed and the tube was vortexed 
gendy in an inverted position for 10 seconds for to disperse 
Solution HI through the viscous bacterial lysate. Solution in 
was prepared by admixing 60 ml of 5M potassium acetate, 
11.5 ml of glacial acetic acid and 28.5 ml of water. The 
resultant bacterial lysate was then stored onice for 5 minutes 
followed by centrifugation at 12,000xg for 5 minutes at 4° 
C. in a microfuge. The resultant supernatant was recovered 
and transferred to a new tube. To the supernatant was added 
an equal volume of phenol: chloroform and the admixture 
was vortexed. The admixture was then centrifuged at 
12.000xg for 2 minutes in a microfuge. The resultant super- 
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of which are listed in Table 7 below, were used in the PCR 
reaction to amplify the mature gene for cpVIH member 
anchor domain and incorporate the two cloning sites. Spe I 
and EcoR L For the PCR reaction. 2 ul containing 1 
nanogram (ng) of M13mpl8 replicative form DNA was 
admixed with 10 ul of 1 Ox PCR buffer purchased commer- 
cially (Promega Biotech. Madison. Wis.) in a 0.5 ml 
microfuge tube. To the DNA admixture, 8 ul of a 2.5 mM 
j solution of dNTTPs (dATP. dCIP. dGTP. oTTP) was admixed 
to result in a final concentration of 200 raicroraolar (uM). 
Three ul (equivalent to 60 picomoles (pM)) of the 5* forward 
AK 5 primer and 3 ul (60 pM) of the 3* backward AK 6 
primer was admixed into the DNA solution. To the. 

15 admixture, 73 ul of sterile water and 1 ul/5 units of poly- 
merase (Promega Biotech) was added. Two drops of mineral 
oil were placed on top of the admixture and 40 rounds of 
PCR amplification in a thermocycler were performed. The 
amplification cycle consisted of 52° C. for 2 minutes. 72° C. 

20 for 1.5 minutes and 91° C. for 2 minutes. The resultant PCR 
modified cpVTH membrane anchor domain DNA fragment 
from M13mpl8 containing samples were then purified with 
Gene Clean (BIOlOl, La Joila, Calif.), extracted twice with 

25 pheDol/chlorofonru once with chloroform followed by etha- 
nol precipitation and were stored at -70° C. in 10 mM 
Tris-HCl. pH 7.5. and 1 mM EDTA. 

TABLE 7 



SEQ. 

ID NO. Primer 

(103) 1 AK 5 (F) S GIGCCCAGGG/GTUTACXAGTGCTGAGGGTGAC&& 3* 

(104) a AX 6(B) 5 AUIU UAAl ltJ lAlLAOC Ulg jl 3 

(105) 3 Hc3 (F) S AOOTCCAGCTTCTCGACrrCTGG 3' 

(106) 4 AK 7 (B) S GTCACCCTCAGCACTAGTA.CAATCCCTCGGCAC 3" 
(\V7? G-3 (F) S UAUAU UAL'IAU lUCTGGCUUiUUU ll niVAl'ii; 

GTTTGTGAATATCA*A~J : 

(108) * G-3 (B) 5* 11AC1AUC 1 AULA1AA IAAC GG AATACCCAAAA 

GAACTGG 3* ' 

(109) 7 LAC-F 5* IAIUC UUCTAGTAA CACGACAGGrTTCCCGAC 

TGGT 

(H0)° LAC-B S ACCOAGCK XtAATTC GTAATCATGGTC 3' 

F Forward Primer 
B Backward Primer 

'From 5 to 3": the overlapping sequence for C„l 3' end is double underlined: the Spe I 
restriction site sequence is single underlined; the overlapping sequence for cpVTJJ is double 
underlined. 

'EcoR I restriction site sequence is single underlined 
*Xho I restriction site sequence is underlined 

^From 5' to 3: the overlapping sequence for cpVTJI is double underlined; the Spe I restriction 
site sequence is single underlined; the overlapping sequence for C H 1 3' end is double 
underlined. 

*rrom 5' to 3*: Spe I restriction site sequence is single underlined; the overlapping sequence 
with the 5' end of cpm is double underlined 

*From 5' to 3": Nhe I restriction site sequence is single underlined; the overlapping sequence 
with 3* end of cpm is double underlined. 

'From 5" to 3': overlapping sequence with the 3' cod of cpm is double underlined; Nhe I 
restriction sequence begins with the nucleotide residue "G" at position 4 and extends 5 more 
residues = GCTAGC. 

*EcoR I restriction site sequence is single underlined. 



natant was transferred to a new tube and the double-stranded 
bacteriophage DNA was precipitated with 2 volumes of 
ethanol at room temperature. After allowing the admixture 60 
to stand at room temperature for 2 minutes, the admixture 
was centrifuged to pellet the DNA. The supernatant was 
removed and the pelleted replicative form DNA was resus- 
pended in 25 ul of Tris-HO, pH 7.6, and 10 mM EDTA (TE). 

The double-stranded M13mpl8 replicative form DNA 65 
was then used as a template for PCR. Primers, AK 5 (SEQ. 
ID. NO. 103) and AK 6 (SEQ. ID. NO. 104), the sequences 



To verify amplification of the modified cpVUI membrane 
anchor domain, the PCR purified DNA products were elec- 
trophoresed in a 1% agarose geL The expected size of the 
cpvm was approximately 150 base pairs. The area in the 
agarose containing the modified cpVUI DNA fragment was 
isolated from the agarose as described above. The sequence 
of the isolated modified cpVm DNA fragment is listed as 
SEQ. ID. NO. Ill- The isolated cpVUI DNA fragment was 
then admixed with a similarly prepared fragment of modi- 
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fied Fd as described below in Example 2i in order to form 
a DNA segment encoding the fusion protein Fd-cpVm. 

cpm Membrane Anchor: M13mpl8 was also used as a 
source for isolating the gene encoding the membrane anchor 
domain at cpm. the sequence of which is listed in the 
Sequence Listing as SEQ. ID. NO. 112. The amino acid 
residue sequence of membrane anchor domain cpm is listed 
in SEQ. II). NO. 16. M13mpl8 replicative form DNA was 
prepared as described above and used as a template for PCR 
for amplifying the mature gene for cpHI membrane anchor 
domain and incorporating the two cloning sites. Spe I and 
EcoR I. 

The primer pair. G-3(F) (SEQ. ID. NO. 107) and G-3(B) 
(SEQ. ID. NO. 108) listed in Table 7. was used in PCR as 
performed above to incorporate Spe I and Nhe I restriction 
$ites.-The resultant PCR modified cpm DNA fragment was 
verified and purified as described above. The sequence of the 
PCR modified cpm membrane anchor domain DNA frag- 
ment is listed in the Sequence Listing as SEQ. ID. NO. 113. 
A second PCR amplification using the primer pairs. Lac-F 
(SEQ. ID. NO. 109) and Lac-B (SEQ. ID. NO. 110) listed 
in Table 7. was performed on a separate aliquot of M 13mpl 8 
replicative form template DNA. The primers used for this 
amplification were designed to incorporate an overlapping 
sequence with the nucleotides encoding the membrane 
anchor region of cpUL and the adjacent Nhe I site along with 
a sequence encoding a LacZ promoter region 5' to an EcoR 
I restriction site. The reaction and purification of the PCR 
product performed as described above. The sequence of the 
resultant PCR modified cpm DNA fragment having Nho I 
and EcoR I restriction sites is listed in the Sequence Listing 
as SEQ. ID. NO. 114. 

The products of the first and second PCR amplifications 
were then re combined at the nucleotides corresponding to 
cpm membrane anchor overlap and Nhe I restriction site 
and subjected to a second round of PCR using the G3-F 
(SEQ. ID. NO. 107) and Lac-B (SEQ. ID. NO. 110) primer 
pair to form a recombined PCR DNA fragment product 
consisting of the following: a 5' Spe I restriction site; a cpm 
DNA membrane anchor domain beginning at the nucleotide 
residue sequence which corresponds to the amino acid 
residue 198 of the entire mature cpm protein; an endog- 
enous stop site provided by the membrane anchor at amino 
acid residue number 112; a Nhe I restriction site, a LacZ 
promoter sequence; and a 3' EcoR I restriction site. The 
recombined PCR modified cpm membrane anchor domain 
DNA fragment was then restriction digested with Spe I and 
EcoR I to produce a DNA fragment for directional ligation 
into a pComb phagemid expression vector prepared in 
Example la(iv) and to form a pCombm phagemid expres- 
sion vector as described in Example lb(ii). 

h. Isolation of Anti-NPN Coding V„ DNA Segment 

To prepare modified Fd fragments for recombination with 
the PCR modified cpvm membrane anchor domain frag- 
ment to form a Fd-cpVm DNA fusion product PCR ampli- 
fication as described above was performed using Clone 2b, 
prepared in Example 2f. as a template. The primers. Hc3 
(SEQ. ID. NO. 105) and AK 7 (SEQ. ID. NO. 106), the 
sequences of which are listed in Table 7. were used in PCR 
to amplify the Fd portion of the Clone 2b and incorporate 
Xho I and Spe I cloning sites along with a cpvm overlap- 
ping sequence. The amplified PCR modified Fd product was 
purified, electrophoresed and isolated from 1% agarose gels 
as described above. The size of the Fd fragment was 680 
base pairs. 

i. Preparation of a DNA Segment Encoding a Portion of the 
Fusion Protein Fd-cpVm 
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The purified PCR modified Fd DNA fragment containing 
cpvm overlapping nucleotide sequences prepared above 
was then admixed with the PCR modified cpvm membrane 
anchor domain fragment to form an admixture. The frag- 

5 ments in the admixture were allowed to recombine at their 
complementary regions. The admixture containing the 
recombined PCR fragments was then subjected to a second 
round of PCR amplification as described above using the 
end primer pair AK 6 (SEQ. ID. NO. 104) and Hc3 (SEQ. 

10 ED. NO. 105) (Table 7). The corresponding product of the 
PCR amplification was purified and electrophoresed on 
agarose gels as described above. The PCR product was 
determined to be approximately 830 base pairs (Fd=680+ 
150) confirming the fusion of Fd with cpVm. The sequence 

15 of the PCR product linking the Fd sequence with the cpVm 
sequence in frame in a 5* to 3' direction is listed as SEQ. ID. 
NO. 115. The Fd-cpVUI fusion product was then used in 
directional ligations described in Example 2j for the con- 
struction of a pCBAK8-2b dicistronic phagemid expression 

20 vector. 

j. Construction of pCBAK8-2b Dicistronic Expression Vec- 
tor 

To construct a phagemid vector for the coordinate expres- 
sion of a Fd-cpvm fusion protein with kappa light chain, the 

25 PCR amplified Fd-cpVHJ fusion product prepared in above 
in Example 2i was first ligated into Clone 2b phagemid 
expression vector isolated from the NPN combinatorial 
library prepared in Example 2f. For the ligation, the 
Fd-cpvm PCR fusion product was first restriction digested 

30 with Xho I and EcoR L Clone 2b phagemid vector was 
similarly digested resulting in the removal of the cloning and 
decapeptide regions. The digested Fd-cpVm fragment was 
admixed and ligated into the digested Gone 2b at the 
cohesive termini generated by Xho I and EcoR I restriction 

35 digestion. The ligation resulted in operatively linking the 
nucleotide residue sequence encoding the Fd-cpVIQ 
polypeptide fusion protein to a second cassette having the 
nucleotide residue sequences encoding the ribosome binding 
site, a pelB leader sequence and the kappa light chain 

40 already present in Clone 2b to form a dicistronic DNA 
molecule in the original Done 2b phagemid expression 
vector. 

coli. strain TGI. was then transformed with the 
phagemid containing the dicistronic DNA molecule and 

45 transformants were selected on ampicillin as the original 
Clone 2b contained an ampicillin selectable resistance 
marker gene. For high efficiency electro-transformation of 
E. cbti* a 1:100 volume of an overnight culture of TGI cells 
was inoculated into one liter of L-broth (1% Bacto tryptone. 

50 0.5% Bacto yeast extract 0.5% NaCl). The cell suspension 
was maintained, at 37° C. with vigorous shaking to a 
absorbance at 600 nm of 0.5 to 1.0. The cell suspension in 
log phase growth was then harvested by first chilling the 
flask on ice for 15 to 30 minutes followed by centrifugation 

55 in a cold rotor at 4000 g for 15 minutes to pellet the bacteria. 
The resultant supernatant was removed and the bacterial cell 
pellet was resuspended in a total of one liter of cold water 
to form a cell suspension. The centrifugation and resuspen- 
sion procedure was repeated two more times and after the 

60 final centrifugation. the cell pellet was resuspended in 20 ml 
of cold 10% glycerol. The resuspended cell suspension was 
then centrifuged to form a cell pellet. The resultant cell pellet 
was resuspended to a final volume of 2 to 3 ml in cold 10% 
glycerol resulting in a cell concentration of 1 to 3xl0 iO 

65 cells/ml For the electro- transformation procedure. 40 ul of 
the prepared cell suspension was admixed with 1 to 2 ul of 
phagemid DNA to form a cell-phagemid DNA admixture. 
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The resultant admixture was mixed and allowed to sit on ice 
for one minute. An electroporation apparatus, for example a 
Gene Pulsar, was set a 25 uF and 2.5 kV. The pulse controller 
was set to 200 ohms. The cell-DNA admixture was trans- 
ferred to a cold 0.2 cm electroporation cuvette. The cuvette 
was then placed in the chilled safety chamber and pulsed 
once at the above settings. To the pulsed admixture. 1 ml of 
SOC medium was then admixed and the cells were resus- 
pended with a Pasteur pipette (SOC medium was prepared 
by admixing 2% Bacto tryptone. 0.5% Bacto yeast extract. 
10 mM Nad. 2.5 mM KC1. 10 mM MgCl 2 . 10 mM MgS0 4 . 
and 20 mM glucose). The cells suspension was then trans- 
ferred to a 17x100 mm polypropylene tube and main Lai ned 
at 37° C. for one hour. After the maintenance period, the 
transformed TGI cells were then plated on ampicillin LB 
plates for selection of ampicillin resistant colonies contain- 
ing the phagemid which provided the selectable marker 
gene. 

Ampicillin resistant colonies were selected and analyzed 
for the correct insert size and expression of Fab. Briefly. 
DNA rninipreps of selected colonies were prepared for the 
isolation of phagemid DNA. The isolated phagemid DNA 
from each miniprep was restriction digested with Xho I and 
EcoR I and the digests were electrophoresed on a \% 
agarose gel. Gone AK16 was selected as an 830 bp fragment 
was visualized on the gels confirming the insertion of the 
Fd-cpVm PCR fusion product into digested Gone 2b. 

Gone AK16 phagemid was then restriction digested with 
Xho I and Xba I and the nucleotide residue sequence of the 
dicistronic DNA molecule encoding the Fd-cpVHl fusion 
protein, the ribosome binding site and pelB leader sequence 
for expression of the light chain, a spacer region and the 2b 
kappa light chain was isolated by agarose gel electrophore- 
sis. The isolated dicistronic DNA fragment was then ligated 
into a Xho I and Xba I restriction digested pCBAKO 
expression vector prepared in Example lc(ii) to form a 
dicistronic phagemid expression vector designated 
p€BAK8-2b. 

The resultant pCBAK8-2b expression vector consisted of 
nucleotide residue sequences encoding the following ele- 
ments: f 1 filamentous phage origin of replication; a chloram- 
phenicol acetyl transferase selectable resistance marker 
gene; an inducible LacZ promoter upstream from the LacZ 
gene; a multiple cloning site flanked by T3 and T7 poly- 
merase promoters; and the dicistronic DNA molecule (a first 
cassette consisting of a ribosome binding site, a pelB leader, 
and a Fd-cpVITJ DNA fusion product operarively linked to 
a second cassette consisting of a second ribosome binding 
site, a second pelb leader, and a kappa light chain), 
k. Construction of p€BAX3-2b Dicistronic Expression Vec- 
tor 

To construct a phagemid vector for the coordinate expres- 
sion of a Fd-cpHI fusion protein with kappa light chain, the 
PCR amplified and recombined cpHI membrane anchor 
prepared in Example 2g having a 5* Spe I and 3* EcoR I 
restriction site was first directionally ligated into a pComb 
phagemid expression vector prepared in Example la(iv) to 
form a pCombm phagemid vector. See Example lb(ii) for 
details of vector construction. The resultant pCombUI 
phagemid vector was then restriction digested with Sac II 
and Apa I to form an isolated fragment The resultant 
isolated fragment containing the expression control 
sequences and the cpm sequence was then directionally 
ligated into a similarly digested pCBAKO phagemid vector 
prepared in Example lc(ii) to form a pCBAK3 phagemid 
expression vector. This vector lacked Fd and kappa light 
chain sequences. 
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A phagemid expression vector. pCBAK3-2b. for the 
expression of a fusion protein and kappa light chain was then 
constructed. Briefly, the pCBAK3 phagemid expression 
vector prepared above was first digested with Xho I and Spe 

5 I to form a linearized pCBAK3 phagemid expression vector. 
PCR amplified and modified Fd fragment prepared in 
Example 2h containing Xho I and Spe I sites, was subse- 
quently restriction digested with Xho I and Spe L The 
resultant Fd fragment was then directionally ligated via 

10 cohesive termini into the Xho I and Spe I restriction digested 
pCBAK3 phagemid expression vector to form a second 
phagemid expression vector in which the PCR modified Fd 
fragment was operarively linked in-frame to nucleotide 
residue sequences encoding cpm £. coti strain XLl-Blue 

15 (Stratagenc) was then transformed with the above phagemid 
vector containing Fd-cpIQ. Transformants containing the 
Fd-cpHI encoding phagemid were selected on chloram- 
phenicol Phagemid DNA was isolated from chlorampheni- 
col resistant clones and was restriction digested with Sac I 

20 and Xba I to form a linearized phagemid expression vector 
into which a Sac I and Xba I light chain fragment prepared 
below was directionally ligated. 

Phagemid Gone 2b. isolated from the original combina- 
torial library as described in Example 2a. was restriction 

25 digested with Sac I and Xba- 1 to isolate the nucleotide 
residue sequence encoding the kappa light chain. The iso- 
lated kappa light chain sequence was then directionally 
ligated into the Sac I and Xba I restriction digested 
phagemid expression vector prepared above containing 

30 Fd-cpm to form the phagemid expression vector. pCB AK3- 
2b. The resultant vector contained the nucleotide residue 
sequence of a dicistronic DNA molecule for the coordinate 
expression of a Fd-cpHI fusion protein with kappa light 
chain. The resultant phagemid expression vector consisted 

35 of nucleotide residue sequences encoding the following 
elements: fl filamentous phage origin of replication; a 
chloramphenicol acetyl transferase selectable resistance 
marker gene; an inducible LacZ promoter upstream from the 
LacZ gene; a multiple cloning site flanked by T3 and T7 

40 polymerase promoters; and the dicistronic molecule (a first 
cassette consisting of a first ribosome binding site and pelB 
leader operatively linked to Fd-cpIH operarively linked to a 
second cassette consisting of a second LacZ. a second 
ribosome binding site, and a second pelB leader operatively 

45 linked to a kappa light chain). 

XLl-Bluc cells were then transformed with the phagemid 
expression vector p€BAK3-2b. Transformed colonies con- 
taining the chloramphenicol resistant phagemids were 
selected as described above and analyzed for the correct size 

50 insert and expression of Fab as described in Example 2j. 
Following verification of the insert and expression of Fab in 
the pCBAK3-2b phagemid vector, XLl-Bluc cells were then 
transformed and induced for the expression of Fab antibod- 
ies as described in Examples 3 and 4. 

55 3. Expression of Anti-NPN Heterodimer on Phage 
Surfaces 

For expression of antibody Fab directed against NPN on 
phage surfaces, XLl-Blue cells were separately transformed 

60 with the phagemid vectors, pCBAK8-2b and pCBAK3-2b. 
prepared in Examples 2j and 2k. respectively. The transfor- 
mants were selected on LB plates containing 30 ug/ral 
chloramphenicol. Antibiotic resistant colonies were selected 
for each phagemid transformation and grown in liquid 

65 cultures at 37° C. in super broth (super broth was prepared 
by admixing the following: 20 g 3 [N-Morpholino] propane- 
sulfonic acid (MOPS); 60 g tryptone; 40 g yeast extract; and 
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2 liter of water: adjust pH to 7.0 with 10 m NaOH) 
containing 30 ug/ral chloramphenicol and 12.5 ug/mi tetra- 
cycline for the respective antibiotic selection of the 
phagemid and the F episome. The antibiotic resistant trans- 
formed XLl-Blue cells were diluted to an optical density 
(OD^ nm) of 0.4 in super broth. The inducer, isopropyl 
thiogalactopyranoside (IFTG). was admixed to the bacterial 
suspension for a final concentration of 1 mM and the 
admixture was maintained at 37° C. for 1 hour to induce the 
expression of the fusion protein and kappa light chain from 
the LacZ promoter. Helper phage, either R408 or VCS M 13 
(Stratagene), was then admixed to the induced bacterial 
suspension at a ratio of 10-20 helper phage to 1 transformed 
bacterial cell to initiate the generation of copies of the sense 
strand of the phagemid DNA. The admixture containing the 
helper phage was then maintained for an additional two 
hours at 37° C. to allow for filamentous bacteriophage 
assembly wherein the expressed anti-NPN Fab antibodies 
fused to either bacteriophage membrane anchor domains of 
cpvni or cpnt were incorporated into surface of the bac- 
teriophage particles. The bacterial suspension was then 
centrifuged resulting in a bacterial cell pellet and a super- 
natant containing phage. The supernatant was removed, 
collected and assayed as described below for the presence of 
functional anti-NPN Fab molecules anchored to the phage 
particles by either cpVTH or cpIIL 

4. Assays for Verifying the Presence and Function 
of Anti-NPN Heterodimer on the Surface of 
Filamentous Phage 

a. Electron Microscopy 

To localize functional Fab molecules, the binding to 
antigen labelled with colloidal gold was studied. Phage 
containing supernatants and bacterial cells prepared in 
Example 3 were spotted on formvar Polysciences. Inc., 
Warrington. Pa.) coated grids affixed onto a solid phase. In 
some experiments grids were coated with cells and infected 
with phage in situ. Subsequently grids were blocked with 
bovine serum albumin (BSA) 1% in PBS at pH 7.2. washed 
and incubated with 2-7 nanometer (nm) colloidal gold 
particles coated with BSA-NPN hapten conjugate for a time 
period sufficient to form a labeled immunoreaction complex. 
The grids were washed to remove excess gold particles and 
negatively stained in uranylacetate and visualized by elec- 
tron microscopy. 

Examination of filamentous phage and permeabilized 
cells producing phage revealed specific labelling of phage or 
exposed bacterial membranes. Phage were observed to con- 
tain 1 to 24 copies of antigen binding sites per particle. 
Neither helper phage alone nor intact E. coli labelled with 
antigen. Background nonspecific binding was very low. 
Filamentous phage particles emerging from the E. coli 
surfaces were labelled with antigen as shown in FIG. 9. 

The generation of a related phage surface expression 
vector utilizing cpIII as a fusion partner with Gone 2b, 
pCBAK3-2b, revealed specific antigen labelling to the 
phage head but not the column. Additionally human anti- 
tetanus Fab expressed as a cpni fusion did not bind to 
BSA-NPN antigen. 

b. Phase Elisa 

Micr ©titration plates were coated with NPN-BSA conju- 
gate (0. i ml, 1 ugtal in 0. 1M Tris-HQ pH 9.2), and blocked 
with 1% BSA in PBS. Serial two fold dilutions of pCBAK8- 
2b derived phage (0.1 ml), prepared in Example 3. were 
added to the pre-coated microtitration plate and incubated 
for 3 hours at ambient temperature or 16 hours at 4° C. The 
plates were washed with PBS and goat anti-kappa al kalin e 
phosphatase conjugate (Fisher Biotech. Pittsburgh, Pa.) 
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added (0.1 ml diluted 1/1000 in PBS containing 0.1% BSA) 
and incubated for 2 hours at room temperature. The plates 
were washed in PBS and substrate added (0. 1 ml, 1 rug/ml 
p-nitrophenylphosphate in 0.1M Tris-HCl. pH 9.5, contain- 

5 ing 50 mM MgCl 2 ). After incubation at 37° C. for signal 
development, the optical densities at 400 nm were deter- 
mined. Competition assays were performed with the addi- 
tion of increasing amounts of free NPN hapten ranging from 
zero up to 5 mgAvell. 

io The ELISA assays confirmed the presence of functional 
antibody Fab. In a two site ELISA on NPN antigen coated 
plates when probed with anti-mouse kappa chain enzyme 
conjugate, phage supernatant generated from helper phage 
infection of cells carrying the pCBAK8-2b construct exhib- 

15 ited expected titration curves with serial two fold dilutions 
of phage containing antibody. The results of the two-site 
ELISA are shown in FIG. 10. For a signal to be generated in 
this assay, the phage particle must (i) have functionally 
associated Fd and kappa chains and (ii) be multivalent 

20 Specificity of the particle was assessed by inhibiting binding 
to the plate in the presence of increasing concentrations free 
hapten. The generated phage particles exhibited binding to 
solid phase of the ELBA and could be inhibited by addition 
of hapten as shown in FIG. 11. Complete inhibition was 

25 achieved when 5 ng of free NPN hapten was used in the 
assay. Helper phage did not give a signal in the ELISA. 
c. Antigen Specific Precipitation of Phage 

Phage supernatant from XLl-Blue was transformed with 
the pCBAKJWb dicistronic expression vector prepared in 

30 Example 3 ( 1 ml) was incubated with BSA-NPN conjugate 
(10 uL 2 mg/ml) for 18 hours at 4° C. The mixture was then 
pelleted by centrifugation at 3000 rpm on a bench top 
centrifuge and the appearance of precipitate noted. Helper 
phage was used as a control. The pellet was washed repeat- 

35 edly in cold PBS (5x3 ml/wash) and then resuspended in LB 
(0.5 ml). The solubilized precipitates were added to fresh 
XLl-Blue cells (0.5 ml of overnight culture), incubated for 
1 hour at 37° C. and aliquots plated out on LB agar 
containing cNoramphenicol (30 Mg/ml). Colonies were 

40 selected randomly. Colony lifts on nitrocellulose were 
treated with lysozyme to digest the cell wall, briefly treated 
with chloroform to breakdown the outer membrane, blocked 
in BSA 1% in PBS and incubated with l25 l labelled BSA- 
NPN antigen. After several washes in PBS (containing 

45 0.05% Tween-20). film was exposed to the washed and dried 
filter overnight at -70° C. and the autoradiographs were then 
developed. 

Precipitates were obtained with antibody containing 
phage but not helper phage in the presence of BSA-NPN. In 
50 addition, the particles retained infectiviry on subsequent 
incubation with bacterial cells carrying the F episome and 
generated 4xl0 5 colonies from a single solubilized precipi- 
tate. 

Additionally. DNA restriction analysis was carried out to 
55 determine the presence of heavy and light chain inserts. 
DNA restriction analysis of the clones revealed the presence 
of a. Xho and Xba I fragment of 1.4 kb as expected for 
Fd-cpVm fusion construct and kappa chain insert. 
These results give additional evidence for antigen speci- 
60 ficity and multi valency. In addition to providing immuno- 
logical parameters, this precipitation offers possibilities for 
facile enrichment of antigen specific phage particles. In 
principle, phage containing specific antibodies can be highly 
enriched by precipitation with antigens (which may be cell 
65 surface markers, viral, bacterial as well as synthetic 
molecules). The washed antigen-antibody. precipitates can 
be solubilized by the addition of excess antigen and viable 
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phage recovered. For the recovery of rare species an immo- 
bilized antigen may be used which opens the way for 
differential affinity eiution. 

In order to demonstrate the utility of immobilized antigen 
for the enrichment of clones of defined binding specificity, 
a panning experiment was performed. An ampicillin resis- 
tant phagemid expressing an anti-tetanus Fab as a cpvm 
fusion was constructed. Rescue of this clone with helper 
phage produced phage encoding the ampicillin resistant 
phagemid which displayed the anti-tetanus Fab on their coat. 
These phage encoding tetanus specificity were admixed with 
NFN hapten encoding phage (1:100) and allowed to bind to 
a microtitration plate coated with tetanus toxoid. Following 
a one hour maintenance period, the plate was washed 
extensively and phage were then eluted with a low pH 
buffer. Infection of XLl-Blue cells in log phase growth and 
subsequent plating of aliquots on ampicillin and chloram- 
phenicol allowed for direct quantitation of enrichment 
Examination of over 1.000 colonies showed that ampicillin 
resistant colonies derived from the eluted phage exceeded 
chloramphenicol resistant colonies by 27 to 1. Therefore, 
panning enriched the phage displaying the anti-tetanus Fab 
by 2700 fold. This result suggests that a clone of defined 
specificity present at one part per million will dominate over 
nonspecific clones following two rounds of panning. 

5. Advantages of Assembling Combinatorial 
Antibody Fab Libraries Along Phage Surfaces 

A powerful technique for generating and selecting com- 
binatorial Fabs. with 10*"* members, is presented. In the 
vector described herein, the restriction cloning sites for 
inserting PCR generated antibody fragments have been 
retained as previously reported for the lambda vector. The 
rescue of the genes encoding the antibody Fd and kappa 
chains is mediated through the utilization of the f 1 origin of 
replication leading to the synthesis and packaging of the 
positive strand of the vector on co-infection with helper 
phage. Since the 'mature' virus particle assembles by incor- 
porating the major coat protein around the single stranded 
DNA as it passes through the inner membrane into the 
periplasmic space, not only does it capture the genetic 
information carried on the phagemid vector but also incor- 
porates several copies of functional Fab along the length of 
the particle. On subsequent infection of hosts cells carrying 
the F episome the phagemid confers resistance allowing 
selection of colonies on the appropriate antibiotic In 
essence, the antigen recognition unit has been linked to 
instructions for its production. 

The full power of the earlier combinatorial system could 
not be fully utilized since screening allowed ready access to 
only about 0.1-1% of the members. In the phagemid/M13 
system similar size libraries are generated and all the mem- 
bers are accessed via affinity selection. Furthermore, unlike 
the lambda vector which generated monovalent Fabs, this 
system generates multivalent particles, thus allowing the 
capture of a wider range of affinities. 

The unique phagemid restriction sites permit the recom- 
bination of Fd and kappa chains allowing chain replacement 
or shuffling. The rescue of filamentous single stranded DNA 
allows rapid sequencing and analysis of me genetic make up 
of the clone of interest. Indeed it can be envisaged that phage 
encoding antibody specificity may be enriched by antigen 
selection prior to DNA sequencing or mutagenesis. The 
option to further develop an iterative process of mutation 
followed by selection may allow the rapid generation of high 
affinity antibodies from germ line sequences. The process 
may be automated. Setting aside the potential of the system 
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to mimic nature, the phagemid/M13 system would allow a 
more complete dissection of the antibody response in 
humans which may yield useful therapeutic and diagnostic 
reagents. 

5 The membrane anchoring of the heavy chain and the 
compartmentalization of the kappa chain in the periplasm is 
the key to expressing this functional dimeric protein. The 
potential of this system is by no means limited to antibodies 
and may be extended to any protein recognition system or 

10 combination of systems containing multiple members. For 
example coupling of ligand and effector systems in a high 
avidity matrix is now possible. In a similar vein a library of 
ligands can be sorted against a library of receptors. 

15 6. Randomized Mutagenesis of the CDR3 Region 
of a Heavy Chain Encoding Tetanus Toxoid 
a. PCR Mutagenesis with Degenerate Oligonucleotides 
To obtain a mutagenized heterodimer of this invention of 

. altered specificity that would no longer recognize TT but 

20 would recognize and specifically bind to a new antigen, a 
method was developed to randomize only the CDR3 region 
of a heavy chain fragment encoded by a known nucleotide 
sequence. This approach is schematically diagrammed in 
FIG. 12 where a representative heavy chain fragment within 

25 a phagemid clone, consisting of alternating framework 
regions (1 through 4) shown by white blocks and comple- 
mentarity determining regions (CDR) (1 through 3) shown 
by cross-hatched blocks and the first constant region (CHI), 
is subjected to two separate rounds of PCR. In the first PCR 

30 amplification reaction, the 5' end of the heavy chain begin- 
ning at framework 1 and extending to the 3* end of frame- 
work 3 is amplified. In the second PCR amplification 
reaction, the CDR3 region is randomly mutagenized shown 
by the black box. This is accomplished through the use of a 

35 pool of oligonucleotide primers synthesized with a degen- 
erate region sandwiched between and contiguous with con- 
served framework 3 and 4 region sequences. The resulting 
amplification products, each having a randomized CDR3 
region, begin at the 3' end of framework 3 and extend to the 

40 3' end of the CHI region. The pool of degenerate oligo- 
nucleotide primers have been designed to result in the 
amplification of products having a 5' end that is comple- 
mentary to and will overlap with the 3' end of the products 
of the first PCR reaction product Thus, the two separate 

45 PCR reaction products are pooled and subjected to a third 
PCR reaction in which the overlapping region between the 
two products is extended to result in heavy chain having a 
randomized CDR3 region. 

A heavy chain DNA template for use in this invention was 

50 available in a clone (a phagemid vector containing heavy 
and light chain fragments) from a human combinatorial 
anti-tetanus toxoid (TT) Fab library. This library was con- 
structed in the pCBAK-3 dicistronic expression vector for 
the expression of a heavy chain-cpEI fusion protein (Fd- 

55 cpTH) and a soluble light chain as described for anti-NPN in 
Example 2k and by Persson et ai. Proc. Natl. Acad. Sci^ 
USA 88:2432-2436 (1992) and Barbas et al., Proc. Natl 
Acad. ScL, USA 88 :7978- 7982 (1992). A clone, hereinafter 
referred to as pCE-TT7E, was expressed as described for 

60 anti-NPN hetercKiirhers on phage surfaces in Example 3 and 
subsequently screened by panning on TT-coated plates as 
described for anti-NPN in Example 4c . Clone pCE-TT7E 
exhibited a towards TT on the order of 10~ 7 M and was 
enriched over nonspecific phage by ltf-f old as described by 

65 Barbas et al., supra, done pCE-TT7E. having both heavy 
and light chain sequences, was used as the template DNA for 
the randomized mutagenesis of the CDR3 region of the 
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heavy chain to alter antigen binding specificity as described One hundred nanograms of gel purified products from the 
herein. The sequence of the heavy chain was determined as first and second PCR reactions were then admixed with 1 ug 
described in Example la(ii). Two separate PCR reactions each of FTX3 and CG1Z oligonucleotide primers as a 
were performed as illustrated in FIG. 12. primer pair in a final PCR reaction to form a complete heavy 
The first PCR reaction resulted in the amplification of the 5 chain fragment by overlap extension as illustrated in FIG. 
region of the heavy chain fragment in the pC3-TT7E clone ^l. The PCR reaction admixture also contained 10 ul 
beginning of framework region 1 and extending to the end lOxPCR buffer. 1 ul Taq polymerase and 8 ul 2.5 mM d 
of framework region 3 which is located 5' to CDR3 which NTP's as described above. The PCR reaction was performed 
is approximately 400 base pairs in length. To amplify this as described above. To obtain sufficient quantities of ampli- 
region. the following primer pairs were used. The 5* anti- 10 fication product. 15 identical PCR reactions were performed 
sense oligonucleotide primer. FT3X. having the nucleotide ^ c resulting heavy chain fragments beginning at frame- 
sequence 5'-G-CAA-TAA-ACC-CTC-ACT-AAA-GGG-3' wor fc i an d extending to the end of CHI and having 
(SEQ ID NO 118). hybridized to the non-coding strand of randomly mutagenized CDR3 regions were approximately 
the heavy chain corresponding to the region 5* of and 79Q base pairs in length. The heavy chain fragment ampli- 
including the beginning of framework 1. The 3' sense 13 fi cau ' on products from the 15 reactions were first pooled and 
oligonucleotide primer. B7EFR3. having the nucleotide tne n gel purified as described above prior to their incorpo- 
sequence 5'-tCT-CGC-ACA-ATA-ATA-CAC-GGC-3' ration into a phagemid library. 
(SEQ ID NO 119), hybridized to the coding strand of the " b phagemid Library Construction 

heavy chain corresponding to the 3' end of the framework-3 The resultant gel purified heavy chain fragments prepared 
region. The oligonucleotide primers were synthesized by 20 j n Example 6a were then digested with the restriction 
Research Genetics (Hunstville. Ala.). The PCR reaction was enzymes. Xho I and Spe L as described in Example 2d. The 
performed in a 100 ul reaction containi ng o ne ug of each of resultant digested heavy chain fragments were subsequently 
oligonucleotide primers FTX3 and B7EFR3. 8 ul 2.5 mM g C j purified prior to insertion into the pC3-TT7E phagemid 
dNTP's (dATP. dCTP, dGTR dTTP). 1 ul Taq polymerase. vector clone which was previously digested with the same 
10 ng of template pCE-TI/E. and 10 ul of lOxPCR buffer 25 restriction enzymes to remove the non- mutagenized heavy 
purchased commercially (Pr omega Biotech). Two drops of cna j D fragment and form a linear vector. Ligation of 640 ng 
mineral oil were placed on top of the admixture and 35 0 f tnc heavy chain Xho I/Spe I fragments having 
rounds of PCR amplification in a thermocycler were per- mutagenized CDR3 regions into two ug of the linearized 
formed. The amplification cycle consisted of denaturing at c pC3-TT7E phagemid vector to form circularized vectors 
94 C. for one minute, annealing at 50° C. for one minute. 30 having mutagenized CDR3 regions was performed over- 
followed by extension at 72° C. for two minutes. The & room temperature using 10 units of BRL Ugase 
resultant PCR amplification products were then gel purified (Gaithersburg. Md.) in BRL ligase buffer in a reaction 
as described in Example Id and used in an overlap extension volume of 150 uL Five separate ligation reactions were 
PCR reaction with the products of the second PCR reaction. performed to increase the size of the phage library having 
both as described below, to recombine the two products into 35 mutagenized CDR3 regions. Following the ligation 
reconstructed heavy chains containing mutagenized CDR3 reactions, the circularized DNA was precipitated at -20° C. 
regions as illustrated in FIG. 12. for two hours by the admixture of 2 ul of 20 mg/ml 
The second PCR reaction resulted in the amplification of glycogen. 15 ul of 3M sodium acetate at pH 5.2 and 300 ul 
the heavy chain from the 3* end of framework region 3 0 f e thanoL DNA was then pelleted by microcentrilugation at 
extending to the end of CHI region which is approximately ^ 40 q f or 15 minutes. The DNA pellet was washed with cold 
390 base pairs in length. To amplify this region, the follow- -jq<% e thanol and dried under vacuum. The pellet was 
ing primer pairs were used. The 5* anti-sense oligonucleotide resuspended in 10 ul of water and transformed by electropo- 
primer pool, designated 7ECDR3. had the nucleotide ration into 300 ul of E. coli XLl-Blue cells as described in 
sequence represented by the formula. Example 2k to form a phage library. The total yield from the 
5'<nr}-Txr-TAT-TCT^^ 45 mutagenesis and transformation procedure described herein 

nns-nns-nns-nns-nns-nns-nns-NNS-nns-nns-NNS- was approximately 5xl0 7 transformants. 

TGG-OGC-c aa-ggg-a CC-ACG-3* After transformation, to isolate phage on which het- 
where N can be A. C. G, or T and S is either C or G (SEQ erodimer expression has been induced for subsequent pan- 
ID NO 120), wherein the 5* end of the primer pool is ning on target antigens such as fluorescein. 3 ml of SOC 
complementary to the 3' end of framework 3 represented by 50 medium (SOC was prepared by admixture of 20 g bacto- 
the complementary nucleotide sequence of the oligonucle- tryptone. 5 g yeast extract and 0.5 g Nad in one liter of 
otide primer B73FR3 and the 3* end of the primer pool is water, adjusting the pH to 7.5 and admixing 20 ml of glucose 
complementary to the 5* end of framework 4. The region just before use to induce the expression of the Fd-cpUI and 
between the two specified ends of the primer pool is reprc- light chain heterodimer) was admixed and the culture was 
sented by a 48-mer NNS degeneracy which ultimately 55 shaken at 220 rpm for one hour at 37 G. after which 10 ml 
encodes a diverse population of mutagenized CDR3 regions of SB (SB was prepared by admixing 30 g tryptone. 20 g 
of 16 amino acid residues in length. The 3* sense oligonucle- yeast extract, and 10 g Mops buffer per liter with pH 
otide primer. CG1Z. as described by Persson et al., supra, adjusted to 7) containing 20 ug/ml carbenicillin and 10. 
having the nucleotide sequence ug/ml tetracycline and the admixture was shaken at 300 rpm 
5'-GCATGTACTAGTTTTGTCACAAGATTTGGG-3' 60 for an additional hour. This resultant admixture was admixed 
(SEQ ID NO 121), hybridized to the coding strand of the to 100 ml SB containing 50 ug/ml carbenicillin and 10 ug/ml 
heavy chain corresponding to the 3' end of the CHI. The tetracycline and shaken for one hour, after which helper 
second PCR reaction was performed on the pC3-TT7E in a phage VCSM13 (10 12 pfu) were admixed and me admixture 
100 ul reaction as described above containing one ug of each was shaken for an additional two hours. After this time. 70 
of oligonucleotide primers 7ECDR3 and CG1Z. The result- 65 ug/ml kanamycin was admixed and maintained at 30° C. 
ant PCR amplification product was then gel purified as overnight. The lower temperature resulted in better het- 
described above. erodimer incorporation on the surface of the phage. The 
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supernatant was cleared by centrifugation (4000 rpra for 15 
minutes in a JA 10 rotor at 4° C). Phage were precipitated by 
admixture of 4% (w/v) polyethylene glycol 8000 and 3% 
(w/v) NaCl and maintained on ice for 30 minutes, followed 
by centrifugation (9000 rpm for 20 minutes in a JA10 rotor 
at 4° C). Phage pellets were resuspcnded in 2 mi of PBS and 
microcentrifuged for three minutes to pellet debris, trans- 
ferred to fresh tubes and stored at -20° C. for subsequent 
screening as described below. 

For determining the titering colony forming units (cfu). 
phage (packaged phagemid) were diluted in SB and 1 ul was 
used to infect 50 ul of fresh (AOD600=1) E. coU XLI-Blue 
cells grown in SB containing 10 ug/ml tetracycline. Phage 
and cells were maintained at room temperature for 15 
minutes and then directly plated on LB/carbenicillin plates, 
c. Selection of Anti-Fluorescein Heterodimers on Phage 
Surfaces 

1) Multiple Pannings of the Phage Library Having 
Mutagenizcd CDR3 Regions 

The phage library produced in Example 6b having heavy 
chain fragments with mutagenized CDR3 regions was 
panned as described herein on a microtiter plate coated with 
a 50 ug/ml fluorescein-BSA conjugate to screen for anti- 
fluorescein heterodimers. Fluorescein was conjugated to 
BSA according to the methods described in "Antibodies: A 
Laboratory Manual", eds Harlow et al.. Cold Spring Harbor 
Laboratory, 1988. 

The panning procedure described was a modification of 
that originally described by Parmley and Smith (Parmley et 
al.. Gene, 73:30-5-318). Two to four wells of a microtiter 
plate (Costar 3690) were coated overnight at 4° C. with 25 
ul of 50 ug/ml antigen prepared above in 0.1M bicarbonate, 
pH 8.6. The wells were washed twice with water and 
blocked by completely filling the well with 3% (w/v) bovine 
serum albumin (BSA) in PBS and incubating the plate at 37° 
C. for 1 hour. Blocking solution was shaken out, 50jUl of the 
phage library prepared above (typically 10 u cfu) was added 
to each well, and the plate was incubated for 2 hours at 37° 
C 

Phage were removed and the plate was washed once with 
water. Each well was then washed 10 times with TBS/Twee n 
(50 mM Tris-HCl. pH 7.5, 150 mM Nad. 0.5% Tween 20) 
over a period of 1 hour at room temperature — pipetted up 
and down to wash the well, each time allowing the well to 
remain completely filled with TBS/Tween between wash- 
ings. The plate was washed once more with distilled water 
and adherent phage were elutcd by the addition of 50 ul of 
elution buffer (0.1M HQ adjusted to pH 22 with solid 
glycine, containing 1 rng/ml BSA) to each well and incu- 
bation at room temperature for 10 minutes. The elution 
buffer was pipetted up and down several times, removed, 
and neutralized with 3 uJ of 2M Iris base per 50 ul of elution 
buffer used. Eluted phage were used to infect 2 ml of fresh 
(00600=1) £. coU XLI-Blue cells for 15 minutes at room 
temperature, after which 10 ml of SB containing 20 ug/ml 
carbenicillin and 10 ug/ml tetracycline was admixed. 
[Aliquot (20, 10. and l /io ul) were removed for plating to 
determine the number of phage (packaged phagemids) that 
were eluted from the plate.) The culture was shaken for 1 
hour at 37° C. after which it was added to 100 ml of SB 
containing 50 ug/ml carbenicillin and 10 Mg/ml tetracycline 
and shaken for 1 hour. Then helper phage VCSM13 (10 12 
pfu) were added and the culture was shaken fox an additional 
2 hours. After this time. 70 ug/ml kanamycin was added and 
the culture was incubated at 37° C. overnight Phage prepa- 
ration and further panning were repeated as described above. 
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Following each round of panning, the percentage yield of 
phage must be determined, where % yield— (number of 
phage elutedVnumber of phage applied) xl00. 

As an alternative to elution with acid, phage bound to the 

5 wells of the microtiter plate were eluted by admixing 50 ul 
of a solution of 10~ 3 M fluorescein diluted in PBS followed 
by a maintenance period of one hour at 37 C. The solution 
was then pipetted up and down to wash the wells. The 
resultant eluate was transferred to 2 ml of fresh E. coU 

io XLI-Blue cells for infection as described above for prepar- 
ing phage and further panning. In subsequent rounds of 
panning, phage were eluted with lCr*M fluorescein. 

The results of the amount of phage that were specifically 
bound to fluoresce in-coated wells over four consecutive 

15 rounds of panning and eluted with acid or with fluorescein 
alone are shown below in Table 8. Comparable yields of 
phage on which heterodimers were expressed that bound 
specifically to fluorescein were achieved with either elution 
protocol Approximately 20 clones of the 5xl0 7 clones 

20 resulting from the mutagenesis and transformation exhibited 
specificity of binding towards fluoresce in-coated wells. 
These data confirm that mutagenesis of the CDR3 region as 
described in this invention resulted in the altering of a 
heterodimer which initially specifically bound to TT to one 

23 that specifically bound fluorescein. 



TABLE 8 



30 



35 





Phajw Eluted 






Acid Elution 


Fluorescein Elunoo 


round 1 


5.6 x lOVwell 


4.7 x ltf/weD 


round 2 


4.6 x lCftweU 


5.6 x lOVwell 


round 3 


3.75 x lOVwell 


1.35 x lOVwel! 


round 4 


131 x itfVwell 


4.0 x 10*/wdl 



2) Preparation of Soluble Heterodimers for 
Characterizing Binding Specificity to Fluorescein 

40 In order to further characterize the specificity of the 
mutagenized heterodimers expressed on the surface of phage 
as described above, soluble heterodimers were prepared and 
analyzed in ELISA assays on fluorescein-coated plates, by 
competive ELISA with increasing concentrations of soluble 

45 fluorescein-BSA and also by fluorescence quenching assays. 
The latter assays were performed as described in "Fluores- 
cein Hapten: An Immunological Probe", ed E. W. Voss. CRC 
Press. Inc. pp 52-54. 984. 

To prepare soluble heterodimers. phagemid DNA from 

50 positive clones was isolated and digested with Spel and 
NheL Digestion with these enzymes produces compatible 
cohesive ends. The 4.7-fcb DNA fragment lacking the glU 
portion was gel-purified (0.6% agarose) and self-ligated. 
Transformation of E. coli XLI-Blue afforded the isolation of 

55 recombinants lacking the gin fragment Clones were exam- 
ined for removal of the gUJ fragment by Xhol/Xbal 
digestion, which should yield an 1.6-kb fragment Clones 
were grown in 100 ml SB containing 50 ug/ml carbenicillin 
and 20 mM Mg0 2 at 37° C. until an OD^ of 0.2 was 

60 achieved. HTG (1 mM) was added and the culture grown 
overnight at 30° C. (growth at 37° C. provides only a light 
reduction in heterodimer yield). Cells were pelleted by 
centrifugation at 4000 rpm for 15 minutes in a J A 10 rotor at 
4° C. Cells were re suspended in 4 ml PBS containing 34 

65 ug/ml phenylmethylsulfonyl fluoride (PMSF) and lysed by 
sonication on ice (2-4 minutes at 50% duty). Debris was 
pelleted by centrifugation at 14.000 rpm in a JA20 rotor at 
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4° C. for 15 minutes. The supernatant was used directly for Insofar as immunoglobulin genes have three CDR regions 

ELIS A analysis and was stored at -20° C. For the study of on both the heavy chain and the light chain of an 

a large number of clones. 10-ml cultures provided plenty of immunoglobulin, each separated by a distinctive framework 

heterodimer for analysis. In this case, sonications were region, it is to be understood that the above example is 

performed in 2 ml of buffer. 5 readily applicable to introducing mutations into a specific 

The soluble heterodimers prepared above were assayed by CDR by selection of the above 5' and 3' nucleotide 

ft ISA For this assay 1 ug/well of fluorescein-BSA solu- sequences as to hybridize to the framework regions flanking 

lion was admixed to individual wells of a microliter plate the targeted CDR. Thus the above first and second frame- 

and maintained at 4° C. overnight to allow the protein work sequences can be the conserved sequences flanking 

solution to adhere to the walls of the well. After the 10 CDR1 CDR2 or CDR3 ^on either ^thc . heavy or Ught chain, 

maintenance period, the wells were washed one time with Exemplary and preferred is the CDR3 of the human immu- 

PBS and thereafter maintained with a solution of3%BSA to noglobuhn heavy chain. 

block nonspecific sites on the wells. The plates were main- The length of the 3 and 5' ^^^^^^ 

tained at 37° C for one hour after which time the plates were of a subject mutagenizing ohgonucleotide can vary in length 

uuiicudu/ uuciivia c rt i liK i* as is well known, so long as the length provides a stretch of 

inverted and shaken to remove the "A ^n. Mubte . u nucleotides complementary to the target framework 

heterodimers prepared above were then admixed tc .each ^ as ^ £ e ^ ^ ^ case rf mc 3, 

weU and maintained at 37° C. for one hour to form an tenninal nuclcot ide sequence, it must be of sufficient length 

immunoreaction products. Following the maintenance ^ wmplcmentaritY to ^ t^g* framework region located 

period, the wells were washed 10 times with PBS to remove y (Q ^ C p R region to ^ nuitagenized as to hybridize and 

unbound soluble antibody and then maintained with a sec- jq p rov ^ e a y hydroxyl terminus for initiating a primer exten- 

ondary goat anti-human FAB conjugated to alkaline phos- sion ^^00. In the case of the 5* tenninal nucleotide 

phatase diluted in PBS containing 1% BSA. The wells were sequence, it must be of sufficient length and complementa- 

maintained at 37° C. for one hour after which the wells were ^ty to the target framework region located 5' to the CDR 

was 10 times with PBS followed by development with region to be mutagenized as to provide a means for hybrid- 

p-nitrophenyl phosphate. 25 izm g m a **CR overlap extension reaction as described , 

Immunoreactive heterodimers as determined in the above above to assemble the complete immunoglobulin heavy or 

Fl ts a were then analyzed by competition ELBA to deter- light chain. 

roinc the affinity of the mutagenized heterodimers. The Framework regions flanking a CDR are well characterized 

EUSA was performed as described above with increasing in the immunological arts, and include known nucleotide 

concentrations of soluble fluorescein-BSA ranging in con- 30 sequences or consensus sequences as described elsewhere 

centration from lO'^M up to lO" 5 *! in concentration herein. Where a single, preselected i^unoglobulin gene is 

admixed in the presence of the soluble heterodimers. Maxi- to be mutagenized the flank- 

mal inhibition of binding was achieved at a concentration of ">? a particular CDR are known, or be reacWy deter- 

FCfree antigen with a half-maximal inhibition obtained ™* ^ nudeoUc* sequencing protocols. Where a rqxx- 

iu m ncc an^u jui . ™ . toire of immunoglobulin genes are to be mutagenized. the 

with approximately 10 M free antigen. Thu the 35 ^^^^ $ ^ ^ c preferably conserved, as 

mutagenized heterodimers of this invention specifically rec- « -7^ 7ll„w A-Lin 

ogoUe and bind to fluorescein. Additional experiments were descnted elsewhere herein. 

^med to confirm that the mutagenized hrfercdimers no Preferably, the length of the 3' and 5 term^ nucleotide 

longer recognized the TT to which they nonmutagenized sequences are each at least 6 nucleotides in length, and can 

heterodimer originally bound. Fluorescence quenching 40 be up to 50 or more nucleotides in length, although these 

assays were also performed to confirm the specificity of lengths are unnecessary to assure accurate and ^oduable 

binding of the mutagenized heter<xlimers. Soluble net- hybridizabon ™ ^l^Zi 

erodimers prepared from phage that were either eluted with nucleoUdes. and typically are about 18 nucleoUdes. 

acid or with fluorescein alone were equally effective at A particularly preferred framework-defined nucleotide 

binding fluorescein by any of the aforementioned 45 sequencefor use as a yt <^j™J^ 

approaches. The invention of mutagenesis of the CDR3 nucleotide sequence 5 -TGGOGCCAAGGGACCACG-3 

region of the heavy chain of a heterodimer described herein (SEQ ID NO 122). 

thus resulted in the alteration of binding specificity from TT A particularly preferred framework-defined nucleotide 

to fluorescein. sequence for use as a 5* terminus nucleotide sequence has the 

Thus. me above exarnplemustrates am 50 nucleotide sequence 5' - GTGTATTATTGTGCG AG A-3 ' 

the present invention for mutagenizing the complementarity (SEQ ID NO 123). 

determining region (CDR) of an imnwnoglobulin gene, and The nucleotide sequence located between the 3' and 5' 

also illustrates oligonucleotides useful therefor. termini adapted for mutagenizing a CDR can be any nucle- 

In one emfc>diment. therefore, an oligonucleotide is con- otide sequence, insofar as the novel sequence will be incor- 

templated that is useful as a primer in a polymerase chain 55 porated by the above methods. However, the present 

reaction (PCR) for inducing mutagenesis in a complemcn- approach provides a means to produce a large population of 

tarity determining region (CDR) of an mmmnoglc^xilin mutagenized CDR's in a single PCR reaction by the use of 

gene. The oligonucleotide has 3' and 5* termini and com- a population of redundant sequences defining randomized or 
prises (1) a nucleotide sequence at its 3' terminus capable of nearly randomized nucleotides m the CDR region to be 

hybridizing to a first framework region of an iramunoglo- 60 mutagenized. 

butin gene, (2) a nucleotide sequence at its 5' terminus A preferred ohgonucleotide comprises a nucleotide 

capable of hybridizing to a second framework region of an sequence between the above described T and 5' termini that 

immunoglobulin gene, and (3) a nucleotide sequence is represented by the formula: |NNS]„ or |NNK]„, wherein 

between the 3' and 5' termini adapted for introducing muta- N can independentiy be any nucleotide, where S is G or C 
tions during a PCR into the CDR region between the first 65 K is G or T, and where n is from 3 to about 24. In preferred 
and second framework regions of the immunoglobulin gene, enibodiments the preferred oligonucleotides have the for- 

thereby mutagenizing the CDR region. inula; 
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5'-<ntyTAiT(^rTGTGCGAOAfNNS kTGGGGCCAAGGGAC- i s subjected to PCR conditions that include a PCR primer 
CACG-r cseq id NO: 124) and J-CIUT^ATTC^- oligonucleotide as described above constituting the first 

G AGAfNNK LTGGGC5CCAAGGGACC ACG-3 (SEQ ID NO: , s . . D . 

125) * pnroer id a PCR primer pair as is well known to produce an 

amplified PCR product that is derived from the preselected 

Exemplary and particularly preferred is the oligonucleotide 5 CDR region but that includes the nucleotide sequences of 

where with the formula [NNS] ft and n is 16, such that the the PCR primer. The second oligonucleotide in the PCR 

oligonucleotide represents a large population of redundant amplifying conditions can be any PCR primer derived from 

oligonucleotide sequences. the immunoglobulin gene to be mutagenized, as described 

The invention also contemplates a mutagenesis method herein 

foral^gthei^ologi^spedficity of a donedimm- 10 f ^ , eotide of ftis 

nologlobulin gene. The method provides direct mutagenesis . . ^ J , D 

, _ - . ■ * i * i_ - t_ lnventioo as described above, 

in a preselected CDR of an immunoglobulin gene which 

comprises subjecting a recombinant DNA molecule (rDNA) The foregoing is intended as Ulustrauve of the present 

containing the cloned immunoglobulin gene having a target invention but not limiting. Numerous variations and modi- 
CDR to PCR conditions suitable for amplifying a prese- 15 fications can be effected without departing from the true 

Iected region of the CDR. In the method, the rDNA molecule spirit and scope of the invention. 

SEQUENCE LISTINO 

( 1 ) GENERAL INFORMATION: 

( i i i ) NUMBER OP SEQUENCES: 125 

( 2 ) tNFORMATION FOR SEQ ID NO:l: 

( V ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 173 ba»e pon 

< B ) TYPE: ncckic add 

( C ) STRANDEDNESS: doubic 
{ D ) TOPOLOGY: Boot 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOtl: 
GGCCOCAAAT TCTATTTCAA OOAOACAOTC ATAATOAAAT ACCTATTOCC TACOOCAOCC 60 
OCTOOATTOT TATTACTCGC TOCCCAACCA OCCATOOCCC AGOTGAAACT OCTCOAGATT 120 
TCTAOACTAO TTACCCGTAC OACOTTCCGO ACTACGOTTC TTAATAOAAT TCO 173 

( 2 ) INFORMATION FOR SEQ ID NCK2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 173 feeae pc» 
( B ) TYPE: ouckac tad. 

< C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i 1 i ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCn2: 
TCGACGAATT CT AT T A AG A A CCOTAOTCCO OAACGTCOTA COOOTAACTA OTCTAOAAAT 60 
CTCOAOCAGT TTCACCTOOO CCATOOCTOO TTGOGCAOCO AOTAATAACA ATCCAOCOGC 120 
TGCCGTAGGC A A T AGO T AT T TCATTATOAC TGTCTCCTTO AAATAOAATT TOC H3 

( 2 ) INFORMATION FOR SEQ ID NOO: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 131 tescjwr* 
( B ) TYPE: oodeic tad 
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-continued 



( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: fines 

( i i > MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-3: 
TOAATTCTAA ACTAOTCGCC AAOGAOACAG TCATAATOAA ATACCTATTG CCTACGOCAG 60 
CCGCTGGATT OTTAT TACTC GCTOCCCAAC CAGCCATGGC COAOCTCOTC AOTTCTAGAG 120 
TT AAGCOGCC O 1 3 1 

( 2 ) INFORMATION FOR SEQ ID NO:* 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 139 base purs 
( B ) TYPE: webac arid 
( C ) STRANDEDNESS: doable 
( D )TOPOLOOY: txoex 

( j i ) MOLECULE TYPE: DNA (genomic) 

(lit ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO* 
TCOACGGCCO CTTAACTCTA OAACTOACGA GCTCGOCCAT OGCTGOTTOO OCAOCGAOTA SO 
ATAACAATCC AGCGGCTGCC GTAOOCAATA OGT ATTTCAT TATGACTOTC TCCTTOOCOA 120 
CTAOTTTAGA ATTCAAOCT 139 

( 2 ) INFORMATION FOR SEQ ID N03: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 21 amino acids 
( B )TYPE: amino acid 
( D )TOPOLOOY: uaknown 

. ( i i > MOLECULE TYPE: peptide 

( i i i > HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO J: 

Met Lyi Tyr Leu Leu Pro Thr Alt Ala Ala Oly Leu Leu Leu Leu Ali 

Ata Ola Pro Ala Mel 

2 0 

( 2 ) INFORMATION FOR SEQ ID NO*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 aasao acids 
( B ) TYPE: amiao acid 
( D ) TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Erwinia canawra 

( x i > SEQUENCE DESCRIPTION: SEQ ID NOo: 
Met Lyi Tyr Leu Leo Pro Thr Ala Ala Ala Oly Leu Leu Leu Leu Ala 
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-continued 



Ala Gin Pro Ala Gl.a Pro-Ala Met Ala 
2 0 2 5 



( 2 ) INFORMATION FOR SEQ ID NO:7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 amino acids 
( B ) TYPE: annuo acid 
( D ) TOPOLOOY: unknown 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( ▼ i ) ORIGINAL SOURCE: 

( A > ORGANISM: Erwmia carotoYCra 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Ly i Ser Leo 11c Thr Pro lie Ala Ala O I y Leo Leo Leo Ala Pbe 
, 3 10 15 

Sec Olo Tjrr Ser Leo Ala 
2 0 



( 2 ) INFORMATION FOR SBQ ID N08: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 28 amino arid* 
< B ) TYPE: amino acid 
{ D > TOPOLOOY: unknown 

(it ) MOLECULE TYPE: peptide - : 

(tit ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Ly* Me Lyi Thr Gly AU At| Me Le.o Ala Leu Ser Ala Leo Thr 

1 5 10 13. 

Thr Met Met Pbe Ser Ala Ser Ala Leu Ala Lp lie 

2 0 2 5 



( 2 ) INFORMATION FOR SEQ ID NOA 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 2* amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOOY: unknown 

( i i ) MOLECULE TYPE: peptide 

( t i i ) HYPOTHETICAL: NO 

( % 1 ) SEQUENCE- DESCRIPTION: SBQ ID NO* 

Met Met Ly* Arg Asa lie Loo Ala Val Me Val Pro Ala Leo Leo V*l 
1 5 10 15 . 

All Oly Thr All An Alt .Air Oil 
2 0 



( 2 ) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 amino acids 
{ B ) TYPE: amino arid 
( D )TCfCajOGY:aiifa>Qwn 

( i i ) MOLECULE TYPE: peptide 

< i i i ) HYPOTHETICAL: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Met L y s G I ii Set Tbr I It Ala Lou Ala Leu Leu Pro Lew Leu Phe'Tbr 
I 5 10 15 

Pio V>l Thi Lys Ala Arj Thr 

2 0 

{2 ) INFORMATION FOR SEQ ID NO: 1 1: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 25 amino acidi 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: peptide 

( i » i ) HYPOTHETICAL: NO 

(at) SEQUENCE DESCRIPTION: SEQ H> NO-.ll: 

Me t s«r lie Gin His Pie Arg V.l Ala Leo lie Pro Pbe Phe Ala Ala 
1 5 10 15 

Phe Cys Leo Pro Vat Phe Ala Hii Pro 

■2 0 2 5 

( 2 ) INFORMATION FOR SEQ ID NO: 12: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 amino acid* 
( B ) TYPE: tmino acid 
( D ) TOPOLOOY: onknocni . 

( i i ) MOLECULE TYPE: peptide 

{ i i i ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Met lie Tbr Leu Arg Lyi Leu Pro Leo Ala Val Ala Val Ala Ala 
I 5 10 15 

Oly Val Met Ser Ala Oil A 1 n Me t Ala Val A * p 
2 0 2 5 

( 2 ) INFORMATION POR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 ehbdo coda 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: unknown 

( i t ) MOLECULE TYPE: peptide 

( i t i ) HYPOTHETICAL: NO 

( * i ) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ly» Ala Thr Ly» Leu Vol Leu Oly Ala Val lie Leo Gly Ser Tbr 
1 3 10 15 

Leu Leu Ala Oly Cy* Ser 
2 0 

( 2 ) INFORMATION FOR SEQ ID 1*0:14: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 tmino tckb 
( B ) TYPE: anmo acid 
< D ) TOPOLOGY: t 

( i i ) MOLECULE TYPE: pepode 

( t i i ) HYPOTHETICAL: NO 



(ii) SEQUENCE DESCRIPTION: SEQ ID NO:H: 
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Mel Ly I Ly t Ser Leu Val Lea Lyi Aim Ser Vat Ala Vat Ala Tbi L e n 
1 5 . 10 15 

Val Pro Met Leu Ser Phe Ala 
2 0 

(' 2 ) INFORMATION FOR 5BQ ID NO. 15: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 18 amino ackb 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: unknown 

{ i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Lyi Lyi Leo Lea Phe Ala lie Pro Leu Val Val Pro* Pbe Tyr Ser 
] 5 10 13 

H i *» S e r 

( 2 ) INFORMATION FOR SEQ ID NO: 16: 

( i ) SEQUENCE CHARACTERISTICS: 

{ A ) LENGTH: 211 annuo adds 

( B ) TYPE: amino acid 

< D ) TOPOLOOY: onknowa 

( i i ) MOLECULE TYPE: protean 

( i i i ) HYPOTHETICAL: NO 

( t ) FRAGMENT TYPE; internal 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.16: 

Pro Pbe Val Cy* Q 1 o Tyr O 1 ft Ol y Gin G 1 y O 1 o Ser Sci Aip Lea Pro 
1 5 10 15 

Ola Pro Pro Val Aid Ala Ol y Oly Gly Ser Oly Gly Gly Ser Ol y Gly 
2 0 2 5 30 

Gty Ser Ota Gly Gly Oly Ser Oln Gly Oly Gly Ser Ola Oly Oly Oly 
3 5 4 0 45 

Ser Olu Oly Oly Oly Ser Oly Oly Gly Ser Gly Ser Oly A * p Pbe Aip 
5 0 5 5 6 0 

Tyr Ota L y » Met Ala Asn Ala Asa Lyi Oly Ala Mel Tbr Olu Asa Ala 
65 70 7 5 80 

A * p Ola Asa Ala Leu Ola Sir Alp Ala Lyi Oly Lyi Lea Asp Ser Val 

8 5 '90 95 

Ala Tbr Aip Tyr Oly Ala Ala lie Aip. Oly Phe lie Oly Aip Val Ser 
100 105 110 

Oly Leo Ala A*» Oly Ain Gly Ala Tbr Gly Aip Pbe Ala Oly Ser A«n 
115 12 0 12 5 

Ser Gin Met Ala Gin Val Gly Aip Gly Aip Ain Ser Pro Lea Met Ain 
13 0 13 5 14 0 

Asa Pbe Arg Gin Tyr Lea Pro Ser Lea Pro Ola Ser Vat Ola Cys Arg 
145 150. 155 160 

Pro Pbe Val ?h« Sot Ala Gly Ly* Pro Tyr Ol o Pbo Ser lie Aip Cyi 
1 6 5 1 7 0 1 7 5 

A » p Lyi lie Ain Lea Pbe Arg Oly V*] Pbe Ala Phe Lea Lcn Tyr Val 
18 0 18 3 19 0 

Ala Tbr Pbe Met Tyr Val Pbe Ser Tbr Pbe Ala Ain lie Leo Arg Ais 
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L y s Ola Sei 
1 1 0 



( 2 > INFORMATION FOR SEQ ID NO: 17: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 50 arioo scids 
( B ) TYPE: axono acid 
( D ) TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala Olu G 1 y A » p Asp Pro Ala Lji Ala Ala Pbe Aio Ser Loa Ola Ala 
1. 5 10 1.5 

Ser Ala Thi Olu Tyr 11c Oly Tyi Ala Trp Ala Met Val Val Val lie 

20 . ' 2 5. 30 

Val Oly Ala Tar lie Oly lie Lyi Leo Pbe Ly* Lyi Pbe Tbr Ser Lyi 
35 40 43 

Ala Ser 

5 0 



( 2 ) INFORMATION FOR SEQ ID NO 18: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B >TYPE: Buckk add 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: Hnetr 

( i i ) MOLECULE TYPE: RNA (acaomoc) 

( i i k ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOUS: 
AAUCUUGGAO OCUUUUUUAU OOUUCOUUCU 



( 2 ) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE: cocleic add 
( C ) STRANDEDNESS: docMe 
( D )TOPOLOGY: Bacar 

( i i ) MOLECULE TYPE: RNA (aencmic) 

< i t i > HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
UAACUAAOOA UOAAAUGCAU GUCUAAOACA 



( 2 ) INFORMATION FOR SEQ ID NO-J20: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 30 base pen " 
< B ) TYPE: nnclric effld 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: RNA (aeoenric) 



< i i i ) HYPOTHETICAL: NO 
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( i ▼ ) ANTI- SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO JO: 
UCCUAGGACO UUUOACCUAU GCOAOCUUUU 

( 2 ) INFORMATION FOR SEQ rD N021: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 buc pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS. doable 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: RNA (genome) 

(iii ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE; NO 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
AUOUACUAAO GAGGUUOUAU GGAACAACOC 

( 2 > INFORMATION FOR SEQ ID NO:22: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pun 
( B ) TYPE: nockic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: finear 

( i i ) MOLECULE TYPE: DNA (genome) 

(iii ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID N022: 
GGCCGCAAAT TCTATTTCAA OOAOACAQTC AT 



( 2 ) INFORMATION FOR SEQ ID NO-J3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pain 
( B ) TYPE: aoclek add 
{ C ) STRANDEDNESS: single 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (gcoctmc) 

(iii ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOt23: 
AATOAAATAC CTATTGCCTA COGCAOCCOC TGOATT 



( 2 ) INFORMATION FOR SEQ ID NO:24: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: mefcic acid 
( C ) STRANDEDNESS: oajdc 
( D ) TOPOLOGY: Bacar 

( i t ) MOLECULE TYPE: DNA (genomic) 

(lit ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 



< t i ) SEQUENCE DESCRIPTION: SEQ ID N034: 



83 



5.759.817 



-continued 



84 



OTTATTACTC gctgcccaac cagccatogc cc 

( 2 ) INFORMATION FOR SEQ ID NOS5: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 29 base pain 
< B )TYPE: nucJcic acid 
( C ) STRANDEDNESS: imgte 
1 D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N02S: 
CAOTTTCACC TOOGCCATGO CTOCTTGOG 

( 2 > INFORMATION FOR SEQ ID NO J6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 40 ba»e pzurs 
( S ) TYPE: mckic acid 
; ( C ) STRANDEDNESS: tingle 
( D ) TOPOLOOY: linear 

( j i ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CAOCOAOTAA TAACAATCCA CCOGCTOCCO TAOGCAATAO 

( 2 ) INFORMATION FOR SBQ ID NO:27; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 38 base pain 
( B ) TYPE: nocloc add 
( C ) STRANDEDNESS: uagk 
( D ) TOPOLOGY: Etnear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO . 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NC«7: 
OTATTTCATT ATGACTOTCT CCTTOAAATA OAATTTGC 



( 2 ) INFORMATION FOR SEQ ID NOO»: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 40 base pain , 
{ B )TYPE: oucJoc acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOOY: Knear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

{ i v ) ANTI -SENSE: NO . 

(i i ) SEQUENCE DESCRIPTION: SEQ ID N02S: 
AGOTOAAACT OCTCOAOATT TCTAOACTAO TTACCCOTAC 



( 2 ) INFORMATION FOR SEQ ID N029- 
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f i' ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 38 base pain 
( B ) TYPE: nucleic «ad 
( C ) STRANDEDNESS: Magic 
t D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i > HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.29: 
CGOAACGTCC TACOOOTAAC TAOTCTAOAA ATCTCOAO 

( 2 ) INFORMATION FOR SEQ ID NO-JO: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 base pain 
{ B ) TYPE: naclbc acid 
< C ) STRANDEDNESS: single 
( D ) TOPOLCO Y: linear 

< i i ) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL: NO 

( iv) ANTI-SENSE: NO 

< i i ) SEQUENCE DESCRIPTION: SEQ ID NO-JO: 
OACOTTCCOO ACTACGOTTC TTAATAGAAT TCO 

( 2 ) INFORMATION FOR SBQ ID NOO I : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 28 txac pain 
( B ) TYPE: aockk acid 
( C ) STRANDEDNESS: single 
{ D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO 31: 
TCOACOAATT CTATTAAGAA CCOTAOTC 

( 2 ) INFORMATION FOR SEQ ID NOJ2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pain 
. ( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO--32: 
TOAATT CT A A ACTAGTCGCC AAOOAOACAO TCAT 



( 2 ) INFORMATION FOR SBQ ID N003: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pain 
{ B )TYPE: Bodeic sod 
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( C ) STRANDEDNESS: iiaglc 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-33: 
AATOAAATAC CTATTGCCTA COOCAGCCOC TGOATT 

( 2 ) INFORMATION FOR SEQ ID NO-J4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENOTH: 31 base pain 
( B ) TYPE: nocksc acid 
( C ) STRANDBDNESS: ungle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (gcaxnk) " 

(lit ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ © NO-J4: 
GTTATTACTC OCTOCCCAAC CAOCCATGGC C 

( 2 ) INFORMATION FOR SEQ ID NOS5: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pair* 
( B >TYPE: aocksc acid 
( C ) STRANDEDNESS: italic 
( D )TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i > HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

(ti) SEQUENCE DESCRIPTION: SEQ ID N035: 
OAGCTCOTCA OTTCTAOAOT TAAOCOOCCO 

( 2 ) INFORMATION FOR SEQ ID NO-J6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENOTH: 48 ba*o pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(i*) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N036: 
OTATTTCATT ATOACTGTCT CCTTOOCOAC TAOTTTAOAA TTCAAOCT 

( 2 ) INFORMATION FOR SEQ ID NOJ7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENOTH: 40 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: «n«V» 
( D ) TOPOLOOY: linear 



( i » ) MOLECULE TYPE: DNA (genomic) 
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{ i i i ) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOJ7: 
CAOCOAGT AA TAACAATCCA OCGOCTOCCO TAOGCAATAG 



( 2 ) INFORMATION FOR SEQ ID NOOS: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base paii* 
( B >TYPE: cuckic acid 
( C ) STRANDEDNESS: imgle 
( D ) TOPOLOGY: boss 

( i i ) MOLECULE TYPE: DNA [genomic) 

( i t i ) HYPOTHETICAL: NO 

(ir) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N038: 
TGACOAOCTC GOCCATOOCT GQTTGOO v 

( 2 ) INFORMATION FOR SEQ ID NO-J9: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 24 base pain 

< B ) TYPE: tmckk add 

( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: lbur 

( i i ) MOLECULE TYPE; DNA (genome) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x 1 ) SEQUENCE DESCRIPTION: SBQ ID N009: 
TCOACGOCCG CTTAACTCTA OAAC 



( 2 ) INFORMATION FOR SEQ ID NO^Ch 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 haae pan* 
( B ) TYPE: nucleic ECtd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: liaea 

( i i ) MOLECULE TYPE: DNA genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ 0) NO:<0: 
AOOTSMARCT KCTCOAOTCW OG 



( 2 ) INFORMATION FOR SEQ rD NCh41 : 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pars 
( B ) TYPE: Bodeec rod 
< C ) STRANDEDNESS: smgk 
{ D ) TOPOLOGY: Ewao 

( i i ) MOLECULE TYPE: DNA {genomic) 

( i i i ) HYPOTHETICAL: NO 



( i ▼ ) ANTI-SENSE: NO 
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(xi ) SEQUENCE DESCRIPTION: SEQ ED NO:41 : 
AOGTCCAOCT OCTCOAOTCT GO 



( 2 > INFORMATION FOR SEQ ID NO:42: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENOTH: 22 base pain 
( 8 ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: Hues 

( i i ) MOLECULE TYPE: DNA (genomic) 



< i t i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOvC: 
AOOTCCAOCT GCTCGAOTCA GO 



( 2 ) INFORMATION FOR SEQ ID NO:43: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENOTH: 22 bmc pan 
( B )TYPE: nucleic acid 
{ C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i j > MOLECULE TYPE: DNA (genomic) 



( i i i ) HYPOTHETICAL: NO 
( i t ) ANTI-SENSE: NO 



(ii ) SEQUENCE DESCRIPTION: SEQ ED NO: 43: 
AOOTCCAOCT TCTCOAOTCT OG 



( 2 ) INFORMATION FOR SEQ ID NO*«: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pass 
( B ) TYPE: nockac acid 
( C ) STRANDEDNESS: single 
f D ) TOPOLOGY: linear 



( i i ) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL: NO 



(it) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO-.44: 
AOOTCCAOCT TCTCOAOTCA OG 



( 2 ) INFORMATION FOR SEQ ID NO:45: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 bm pain 
( B ) TYPE: oneiric acid 
( C ) STRANDEDNESS: single 
( D )TOPOL00Y: tinea* 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 



(it) ANTI-SENSE: NO 

( x i } SEQUENCE DESCRIPTION: SEQ ID NO:*5: 
AOOTCCAACT OCTCOAOTCT OG 
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( 2 ) INFORMATION FOR SEQ ID N0.46: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 22 base peon 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: «ngk 
{ D ) TOPOLOGY: Hue* 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i y ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:<6: 



( 2 ) INFORMATION FOR SEQ ID NO:47: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pair* 
( B ) TYPE: DDclbc $aA 
( C ) STRANDEDNESS: single 
( D ) TOPOLOO Y: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ED NO:47: 



( 2 ) INFORMATION FOR SEQ ID NO:48: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pan 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (genomic) 

( t i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

(it) SEQUENCE DESCRIPTION: SEQ ID NO:4S: 



< 2 ) rNPORMATION FOR SEQ ID NO:49: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: tinesr 

( i i ) MOLECULE TYPE: DNA (genomic) 

( a i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
AOOTNNANCT NCTCOAOTCW CO 22 



( 2 ) INFORMATION FOR SEQ ID NO-JO: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pain 
( B ) TYPE: nuckic acid 
( C ) STRANDEDNESS: siagje 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i > HYPOTHETICAL: NO 

( i v > ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO^O: 
GCCCAAGOAT GTGCT CACC 



( 2 ) INFORMATION FOR SEQ ID NOJ1: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 39 base p«n 
( B > TYPE: nucleic acid 
( C ) STRANDEDNESS: lingle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i j ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

{ x i ) SEQUENCE DESCRIPTION: SBQIDN031: 
CTATTAGAAT TCAACOOTAA CAOTGGTGCC TTGGCCCCA 



( 2 ) INFORMATION FOR SEQ ID NO-.52: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 38 base pairs 
( B > TYPE: ouckic acid 
{ C > STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(tit ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

SEQUENCE DESCRIPTION: SEQ ID NO-.52: 

CTATTAACTA GTAACOGTAA CAOTOGTOCC TTGCCCCA 



( 2 ) INFORMATION FOR SEQ ID N053: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base paira 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: ingle 
( D ) TOPOLOO Y: linear * 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i t i ) HYPOTHETICAL: NO 

( t y ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCKS3: 
CTCAGTATGO TOGTTGTGC 



( 2 ) INFORMATION FOR SEQ ID NO-JW: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 26 base pairs 
( B ) TYPE: nuclek nod 
( C ) STRANDEDNESS; single 
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( D >TC*»OLOOY: linear 
( i i ) MOLECULE TYPE: DNA (genomic) 
( j i i ) HYPOTHETICAL: NO 
( i t ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO J4: 
OCT AC T AOT T TTOAT TTC C A CCTTGO 

( 2 ) INFORMATION FOR SEQ ID N033: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 b**e pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingk 
( D ) TOPOLOOY: finer 

( i i ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

< j t ) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NO.55: 
CAOCCATOCC CGACATCCAO A TO 

( 2 ) INFORMATION FOR SEQ ED NOt56: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pin 
( B ) TYPE: nucleic tad 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ i t i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOJ6: 
AATTTTACTA OTCACCTTOO TOCTOCTOOC 

( 2 ) INFORMATION FOR SEQ ID NOJ7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 base parin 
(^B ) TYPE: findeic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: linear 

( t i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOJ7: 
TATOCAACTA OTACAACCAC AATCCCTOOG CACAATTTT 



< 2 ) INFORMATION FOR SEQ ID N038: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 bout pain 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



( i i ) MOLECULE TYPE: DNA (genomic) 
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( i i i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

f % i ) SEQUENCE DESCRIPTION: SEQ ED NO.58: 
AOOCTTACTA OTACAATCCC TOOOCACAAT 

( 2 } INFORMATION FOR SEQ ID N039: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pairs 
( B ) TYPE: nucleic acid 
I C ) STRANDEDNESS: skglc 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

{ x i ) SEQUENCE DESCRIPTION: SEQ ID NOS9: 
CCAOTTCCQA GCTCOTTOTO ACTCAGGAAT CT 

( 2 ) INFORMATION FOR SEQ ID NO:60: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 32 bate pair* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: Jiftgle 
( D )TOPOLOOY: haear 

( i t > MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCM50: 
CCAOTTCCOA OCTCOTGTTO ACOCAGCCOC CC 

( 2 ) INFORMATION FOR SEQ ID N061: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 32 bate pain 
( B ) TYPE: Micleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SBQ ID N061: 
CCAOTTCCOA OCTCOTOCTC ACCCAOTCTC CA 

( 2 ) INFORMATION FOR SEQ ID NOrf2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base jmn 
( B ) TYPE: nucleic add 
( C ) STTRANDEDNESS: lingle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i l i ) HYPOTHETICAL: NO 



< i » ■) ANTI -SENSE: NO 
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( x i > SEQUENCE DESCRIPTION: SEQ ID NOrtt: 
CCAOTTCCOA OCTCCAGATO ACCCAOTCTC CA 

( 2 ) INFORMATION FOR SEQ ID NCh63: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
{ D ) TOPOLOGY: fine* 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( iv ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCh63: 
CCAOATGTGA GCTCOTOATG ACCCAOACTC CA 

( 2 ) INFORMATION FOR SEQ ID NO:64: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: oDckic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Spear 

f i i ) MOCECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

(li ) SEQUENCE DESCRIPTION: SEQ ID NO**: 
CCAGATGTOA OCTCGTCATG ACCCAOTCTC CA 

( 2 ) INFORMATION FOR SEQ ID NOr65: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: onckic acid 
( C ) STRANDEDNESS: single 
< D )TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (gcoccbc) 

( t i i ) HYPOTHETICAL: NO 

{ iv) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO£5: 
CCAOATOTOA OCTCTTQATO ACCCAAACTC AA 

( 2 ) INFORMATION FOR SEQ ID NOrf6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pan 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOOY: liaear 

( i i ) MOLECULE TYPE: DNA (gcaotmc) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:£6: 
CCAGATGTOA OCTCOTOATA ACCCAOOATO AA 
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( 2 ) INFORMATION FOR SEQ ID NO*7: 

f i ) SEQUENCE CHARACTERISTICS: 
t ( A ) LENGTH: 32 base pain 

< B ) TYPE: nucleic acid 

< C > STRANDEDNESS: img&e 
( D ) TOPOLOGY: line* 

( i i ) MOLECULE TYPE: DNA (genomic) 

f i i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N0rf7: 
GCAOCATTCT AOAOTTTCAO CTCCAOCTTO CC 

( 2 ) INFORMATION FOR SEQ ID NCh68: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 33 base pairs 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: stogie 
( D ) TOPOLOGY: Hacar 

( j i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

(xt) SEQUENCE DESCRIPTION: SEQ ID NO*S: 
CCGCCOTCTA OAACACTCAT TCCTGTTOAA GCT 

( 2 ) INFORMATION FOR SEQ ID NO*9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tmgfe 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i t i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOrf9: 
CCOCCOTCTA OAACATTCTO CAOOAGACAO ACT 

< 2 ) INFORMATION FOR SDQ ID NO: 70: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 32 base pan * 

( B ) TYPE: nucleic add 

( C ) STRANDEDNESS: tingle 

( D ) TOPOLOGY: tbear 

( i i ) MOLECULE TYPE: DNA (gcaoanc) 

< i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NOV*): 
CCAOTTCCGA GCTCOTOATO ACACAOTCTC CA 

( 2 ) INFORMATION FOR SEQ ID NO:7J: 

( i ) SEQUENCE CHARACTERISTICS: 
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( A ) LENGTH: 34 base pain 
( B ) TYPE: mckic acid 
( C ) STRANDEDNESS: siagk 
< D )TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (gencroic) 

f i i i ) HYPOTHETICAL: NO 

{ i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCr71: 
OCOCCOTCTA OAATT AACAC TCATTCCTGT TOAA 



( 2 ) INFORMATION FOR SEQ ID NO:72: 

( i ) SEQUENCE CHARACTERISTICS: 
f A) LENGTH: 38 base pain 
{ B ) TTPE: nockac arid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (geoomic) 

(iii) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ 03 NO:72: 
CTATTAACTA GTAACOOTAA CAGTOGTGCC TTGCCCCA 

( 2 ) INFORMATION FOR SEQ ID NO.73: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE: mckic acid 
( C ) STRANDEDNESS: angle 
( D )TOPOLOCY: boear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:73: . 
AGOCTTACTA OTACAATCCC TOOOCACAAT 



( 2 ) INFORMATION FOR SEQ ID NO:?*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 29 base psn 
( B ) TYPE: Docldc acid 
( C ) STRANDEDNESS: angle 
( D ) TOPOLOOY: lima 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

< i » ) ANTI-SENSE: NO 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
OCCOCTCTAO AACACTCATT CCTGTTGAA 



( 2 ) INFORMATION FOR SEQ ID NOT75: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: Z2 base pain 
( B ) TYPE; mckic acid 
( C ) STRANDEDNESS: i mgfe 
( D ) TOPOLOOY: fine* 
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( i i ) MOLECULE TYPE DNA (gcaotnk) 
(ii'i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

( x » > SEQUENCE DESCRIPTION: SEQ ID NO:75: 
AOOTNNANCT NCTCOAOTCT GC 2 2 



( 2 ) INFORMATION FOR SEQ ID NO:76: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 22 has* pain 
( B ) TYPE: nuckic aod 

< C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fine*! 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NO:76: 
AOOTNNANCT NCTCOAOTCA GC 



( 2 ) INFORMATION FOR SEQ ID NO.77: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pain 
( B ) TYPE: nucleic cod 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOOY: Hnesr 

( i i ) MOLECULE TYPE: DNA (genomic) 

(Mi ) HYPOTHETICAL: NO 

< i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:77: 
OTOCCAOATO TOAOCTCOTO ATOACCCAOT CTCCA 



( 2 ) INFORMATION FOR SEQ ID NO:7B: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pain 
f ( B ) TYPE: en ckic rod 

( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fineo 

( i i ) MOLECULE TYPE: DNA (genomic) 

(Mi ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.7S: 
TCCTTCTAOA TTACTAACAC TCTCCCCTOT T O A A 



( 2 ) INFORMATION FOR SBQ ID NO: 79: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pain 
( B ) TYPE: mckic acid 
( C ) STRANDEDNESS: tingle 
( D )TC*OLOaY:Enear 

( i i ) MOLECULE TYPE: DNA (genomic) 



( i i i ) HYPOTHETICAL: NO 
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f i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.79: 
OCATTCTAOA CTATTATOAA CATTCTOTAO GOOC 

( 2 ) INFORMATION FOR SEQ ID NChSO: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 37 base pan* 
( B ) TYPE: Dockic acid 
( C ) STRANDEDNESS: ringk 
( D ) TOPOLOGY: linear 

f i i ) MOLECULE TYPE: DNA (genomic) 

(iit ) HYPOTHETICAL: NO 

( i > ) ANTI-SENSE: NO 

( * i ) SEQUENCE DESCRIPTION: SEQ ID NO:90: 
CTGCACAOOG TCCTOGGCCO AOCTCOTOOT GACTCAO 

( 2 ) INFORMATION FOR SEQ ID NO*!: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 Utc pain 
( B ) TYPE: nucleic *ckl 
( C ) STRANDEDNESS: imgle 
{ D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( t i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*!: 
AGNTGCANNT OCTCGAOTCT GO 



( 2 ) INFORMATION FOR SEQ ID NO:82: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 43 ba*c pm 
( B ) TYPE: oocJeic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Knew 

( i i ) MOLECULE TjYPE: DNA (geaccmc) 

(iit ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:S2: 
OTOOOCATOT GTOAOTTOTO TCACTAOTTG GOGTTTTGAO CTC 



( 2 ) INFORMATION FOR SEQ ID NO:83: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 bate pass 
( B ) TYPE: oudeic acid 
( C ) STRANDEDNESS: angle 
( D ) TOPOLOGY: fioear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( j i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
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AOCATCACTA GTACAAOATT TOOOCTC 



( 2 ) INFORMATION FOR SEQ ID NOM: 

( i > SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 22 base pair* 

< B ) TYPE: suckic acid 

< C ) STRANDEDNESS: single 

< D ) TOPOLOGY: linear 

■ ( i i ) MOLECULE TYPE: DNA (gcoceoic) 
( i i i ) HYPOTHETICAL: NO 
( i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:**: 
AOOTOCAGCT OCTCOAOTCT OO 



( 2 ) INFORMATION FOR SEQ ID NO: 85: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 baw pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: uagk 
( D ) TOPOLOGY: Knear 

( i i ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i > SEQUENCE DESCRIPTION: SEQ ID NO:85: 
AOOTOCAGCT OCTCOAGTCG OO 



( 2 ) INFORMATION FOR SEQ ID NO: 86: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pan* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: uogfe 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (geaoanc) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
AGOTOCAACT OCTCOAOTCT OO 



( 2 ) INFORMATION FOR SEQ ID NO: 87: 

( i ) SEQUENCE CHARACTERISTICS: , 
( A ) LENGTH: 22 base pain 
( B ) TYPE: ouckic acid 
( C ) STRANDEDNESS: single 
( D > TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (gcaosxac) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NOiS7: 



AOGTOCAACT OCTCOAGTCG OO 



5.759.817 

113 114 

-continued 



( 2 ) INFORMATION FOR SEQ. ID NO:88: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 bale pain 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

{ k t. ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOtSS: 
TCCTTCTAOA TTACTAACAC TCTCCCCTGT TGAA 



( 2 ) INFORMATION FOR SEQ ID NO: 89: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 37 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
{ D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( x j ) SEQUENCE DESCRIPTION: SEQ ID NO-J9: 
CTOCACAOOO TCCTOOOCCO AOCTCQTOOT OACTCAO 



( 2 > INFORMATION FOR SEQ ID NO^O: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 31 base pairs 
( B ) TYPE: oodeic acid 
( C ) STRANDEDNESS: ringfc 
( D ) TOPOLOGY: Soear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(til ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO30: 
OCATTCTAOA CTATTAACAT TCTGTAOOOO C 



< 2 ) INFORMATION FOR SEQ ID NOrfM : 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 19 base pain ■ 

< B ) TYPE: nucleic acid 

( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: Bdcm 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID 

ACCCAAOOAC ACCCTCATG 



( 2 ) INFORMATION FOR SEQ ID NOS2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 ba» peon 
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( B ) TYPE: nuckic acid 

( C ) STRANDEDNESS: single 

( D ) TOPOLOGY: finear 

( i i ) MOLECULE TYPE: DNA (gnomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i > SEQUENCE DESCRIPTION: SEQ ID NO:92: 
CTCAOTATOO TOOTTOTOC 



( 2 ) INFORMATION FOR SEQ ID NOS3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 base pain 
( B ) TYPE: nucleic tad 
( C ) STRANDS DNESS: tingle ' 
( D ) TOPOLOGY: linen 

f i i ) MOLECULE TYPE: DNA (genomic) 

( i i i > HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO."93: 
OTCTCACTAO TCTCCACCAA OOOCCCATCO OTC 



( 2 ) INFORMATION FOR SEQ ID N0^4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 base pan 
( B ) TYPE: roddc edd 
( C ) STRANDEDNESS: angle 
( D > TOPOLOGY: finstr 

(it) MOLECULE TYPE: DNA (genomic) 

( t i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( % i ) SEQUENCE DESCRIPTION: SEQ ID N094: 
ATATACTAOT OAOACAGTOA CCAGOGTTCC TTOGCCCC 



{ 2 ) INFORMATION FOR SBQ ID NCh95: 

( i ) SEQUENCE CTIARACTERISTICS: 
( A ) LENOTH: 24 base pain 
( B ) TYPE: nucleic cekJ 
( C ) STRANDEDNESS: single 
(. D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

{ i v ) ANTI-SENSE: NO 

(ii) SEQUENCE DESCRIPTION: SEQ ID N095: 
ACOTCTAGAT TCCACCTTOO TCCC 



( 2 ) INFORMATION FOR SBQ ID NOS6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENOTH: 31 base paan 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: finecr 
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( i i ) MOLECULE TYPE: DNA (genomic) 
(lit ) HYPOTHETICAL: NO 
I i v ) ANTI-SENSE: NO 

f x i ) SEQUENCE DESCRIPTION: SEQ ID NOS6: 
OCATACTAOT CTATTAACAT TCTOTAOOGO C 

( 2 ) INFORMATION FOR SEQ ID N0^7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base pairs 
( B ) TYPE: uockic acid 
( C ) STRANDEDNESS: Jingle 
( D > TOPOLOGY: line* 

( » i ) MOLECULE TYPE: DNA (genomic) 

( i i i > HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i j ) SEQUENCE DESCRIPTION: SEQ ID N037: 
CCOOAATTCT TATCATTTAC CCOOAOA 

( 2 ) INFORMATION FOR SEQ ID NO*93: 

( i ) SEQUENCE CHARACTERISTICS: 
. ( A ) LENGTH: 29 base pairs 
( B ) TYPE: oBcleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOOY: line* 

{ i i )NrOLECULE TYPE: DNA (generic) 

( i i i ) HYPOTHETICAL; NO 

( i v ) ANTVSENSE: NO 

(ii ) SEQUENCE DESCRIPTION: SEQ ID NO$8: 
TCTOCACTAO TTOOAATOGO CACATOCAG 

( 2 ) INFORMATION FOR SEQ ID NOS9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 798 base pan 
( B ) TYPE: rockicacid . 
< C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N03* 



GOCCGCAA AT 


TCTATTTCAA 


GGAGACAGTC 


AT AATOAAAT 


ACCTATTOCC 


TACGOC AGCC 


6 0 


OCTOG ATTOT 


T ATT ACTCOC 


TGCCCAACC A 


OCC ATOOCCC 


AOOTG AA ACT 


GCTCOAOTCA 


I 2 0 


OOACCTGGCC 


TCOTO A AACC 


TTCTCAOT CT 


CTOTCTCTCA 


CCTGCTCTOT 


CACTG ACT AC 


I 8 0 


T CC ATCACC A 


OTGCT TATTA 


CTOGA ACTOO 


ATCCOOCAOT 


TTCCAOOAAA 


CAAACTOOAA 


2 4 0 


TGOATOOGCT 


ACATAAOCTA 


CO ACGGTGTC 


AAT AAOTATO 


ATCCATCTCT 


CA AO A A TCGA 


3 0 0 


ATCT CCATC A 


CTCOTG AC AC 


ATCT AACAAT 


CAGTTTTTCC 


AOAAOTTOAT 


TTCTGTOACT 


3 6 0 


TCTOAOGACA 


C AOOA AC AT A 


TOACTGTTCA 


AGAGOGACTA 


OOOCCTC TOC 


TATOOACTAC 


4 2 0 


TOOOOTCAAO 


GAATTTCAOT 


CACCOTCTCC 


T CAOCC AAA A 


CGAC ACCCCC 


ATCTGTCT AT 


4 8 0 
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CC ACTOOCCC 


CTGGATCTGC 


TOCCC AA ACT 


AACTCCATOG 


TGACCCTGOO 


A T G CCTGGTC 


5 4 0 


AAOGGCT ATT 


TCCC TGAOCC 


AO TGACAGTG 


ACCTGG AACT 


CTGGATCCCT 


O TCCAOCOOT 


6 0 0 


OTOCACACCT 


TCCC AGC TOT 


CCTGC AGTCT 


OACCTCT ACA 


CTCTOAOCAG 


CT CAGTG ACT 


6 6 0 


GTCCCCTCCA 


OCC C TCCOCC 


CAOCOAGACC 


GTC ACCTGCA 


ACGT TOCCCA 


CCCOOCCAOC 


•7 10 


AO C ACC AAGO 


TOOAC AAGAA 


AATTGTOCCC 


AGGG AT TOT A 


CTAOTTACCC 


OT ACGACOTT 


7 8 0 


CCOOACT ACQ 


GTTCTT AA 










7 9 8 



( 2 ) INFORMATION FOR SBQID NO: 100: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 194 base pair* 
( B ) TYPE: n uc l eic tad 
( C ) STRANDEDNESS: double 
( D > TOPOLOGY: tinea 

< i i > MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i > SEQUENCE DESCRIPTION: SBQ ID NO:10Ch 
TGAATTCTAA ACTAGTCGCC AAGOAOACAO TCATAATOAA ATACCTATTG CCTACOOCAG 
CCOCTOOATT OTTACTCOCT GCCCAACCAO CCATGGCCGA GCTCCAGATG ACCCAOTCTC 
CAOCCTCCCT ATCTOCATCT OTOOOAOAAA CTOTCACCAT CACATOTCOA TCAAOTOAOA 
ATATTACAAT TACT 

< 2 ) INFORMATION FOR SEQ ID NO: 10 1: ~ 

( i ) SEQUENCE CHARACTERISTICS: , 

< A ) LENGTH: 333 bme pair» 
( B ) TYPE: nucleic gcid . 

< C ) STRANDEDNESS: double 
( D ) TOPOLOGY: H*«r 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i j i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

<x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



CTOATOCTOC 


ACC A AC TO T A 


TCCATCTTCC 


C ACC ATCC AO 


TOAOCAOTTA 


AC AT CTOOAG 


6 0 


OTOCCTCAGT 


COTGTOCTTC 


TTOAACA ACT 


TCTACCCCAA 


AOACT ACA AT 


OTCAAOOOGA 


1 2 0 


AG AT TO ATOG 


C AGTOAACG A 


CA AAATGOCO 


TCCTOAACAG 


TTOOACTGAT 


CAOOACAOCA 


1 * 0 


AAOAC AOCAC 


CTACAOCATO 


AO C AOCAC CC 


TCACOTTOAC 


CAAGG ACOAG 


TATOAACOAC 


2 4 0 


AT AACAOCT A 


T ACCTGTOAT 


OCCACTCACA 


AOACATC AAC 


TTC ACCC ATT 


OTCA AOAGCT 


3 0 0 


TCAACAOOAA 


TOAOTGTT AA 


T T C TAG A C GO 


CGC 






3 3 3 



( 2 ) INFORMATION FOR SEQ ID NO: 102: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 150 booe pair* 
( B ) TYPE: nnckic cad 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: fine** 

( i i ) MOLECULE TYPE: DNA toewmk) 



( i i i ) HYPOTHETICAL: NO 
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(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:102: 
GCTOAGOGTG ACGATCCCGC AAAAGCGGCC TTTAACTCCC TOCAAGCCTC AOCOACCOAA 
t AT ATCOOTT atocgtoooc GATGOTTGTT OTCATTOTCO GCOCAACTAT CGGTATCAAG 
CTOTTTAAGA AATTCACCTC OAAAGCAAGC 



( 2 ) INFORMATION FOR SEQ ID NO:103: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 36 bait pairs 
( B ) TYPE: nucleic acid 
{ C ) STRANDEDNESS: ibvgJc 
( D ) TOPOLOGY: finear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

< i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ BO NO--103: 
OTGCCCAGOG ATTOTACTAO TOCTOAGOOT G AC GAT 



( 2 ) INFORMATION FOR SEQ ID NO: 104: 

< j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 baae pairs 
( B ) TYPE: nuciek acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: &ne« 

( i i ) MOLECULE TYPE: DNA (genomic) 



( i i i ) HYPOTHETICAL: NO 
(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: J 04: 
ACTCOAATTC TATCAOCTTO CTT TCOAOGT G A A 

( 2 ) INFORMATION FOR SEQ ED NO: 105: 



3 3 



( t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pan 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: &near 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ED NO: 105: 
AGOTCCAGCT TCTCGAGTCT GO 



{ 2 ) INFORMATION FOR SEQ ID NO: 106: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 baae pain 
( B ) TYPE: oudek acid 
( C ) STRANDEDNESS: tingle 
• ( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 
(lit ) HYPOTHETICAL: NO 
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(tv) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO. 106: 
OTCACCCTCA GCACTAOTAC AATCCCTOOO CAC 



( 2 ) INFORMATION FOR SEQ ID NO: 107: 

< t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 48 base pain 
( B ) TYPE: nucleic acid 
< C ) STRANDEDNESS: sinajc 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iit ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x t ) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
OAOACOACTA GTGOTOOCOO TOOCTCTCCA TTCOTTTOTG AATATCAA 



( 2 ) INFORMATION FOR SEQ ID NO: 108: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 40 base pa* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgje 
( D ) TOPOLOGY: hoc* 

( i i ) MOLECULE TYPE: DNA (gcocanc) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ H> NO: 108: 
TTACTAOCTA OCATAATAAC O G A A T AC CCA AAAGAACTOG 

{ 2 ) INFORMATION FOR SEQ ID NO. 109: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pairs 
( B ) TYPE: mckic acid 
( C ) STRANDEDNESS: smgfe 
( D ) TOPOLOGY: haear 

( i i ) MOLECULE TYPE: DNA (geoMiik) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

{ x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
TATOCTAOCT AGTAACACOA CAOGTTTCCC OACTOG 



< 2 ) INFORMATION FOR SEQ ID NOMIO: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 baao pain 
( B ) TYPE: nockic acid 
( C ) STRANDEDNESS: tingle 
( D)TOPOUX>Y:b»r 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NOtllfc 
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ACCGAGCTCO AATTCGTAAT CATGGTC 

( 2 )INPORMAnONFORSEQIDNO:lIl: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 186 base paii 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOlll: 
GTOCCCAOOO ATTGTACTAO TOCTOAOOOT OACGATCCCO CAAAAGCOGC CTTTAACTCC 60 
CTOCAAGCCT CAGCOACCOA ATATATCGGT TATGCOTGOG CGATOGTTOT TGTCATTGTC 120 
OOCOCAACTA TCGGTATCAA OCTGTTTAAO AAATTCACCT CGAAAGCAAO CTGATAOAAT 180 
TCOAOT 186 



< 2 ) INFORMATION FOR SEQ ID NO: 112: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 666 boe pairs 
< B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: doable 
( D > TOPOLOOY: linen 

( i i ) MOLECULE TYPE: DNA (genomic) 

' ( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: It 2: 

CCATTCOTTT GTOAATATCA AGOCCAAOOC CAATCOTCTO ACCTOCCTCA ACCTCCTGTC 60 

AATGCTOOCO OCOOCTCTOO TGOTOOTTCT OOTOOCOOCT CTGAGOOTGG TGOCTCTOAO 120 

OOTGOCOOTT CTOAOGOTOO COGCTCTGAO OGAOOCOOTT CCGOTOOTOG CTCTOOTTCC ISO 

OOTOATTTTG AT TATOAAAA OATOOCAAAC OCTAATAAOO OOOCTATGAC COAAAATOCC 240 

OATGAAAACG COCTACAGTC TGACGCTAAA OOCAAACTTG ATTCTGTCGC TACTOATTAC 300 

OOTGCTOCTA TCGATOOTTT CATTGOTGAC OTTTCCOOCC TTOCTAATGG TAATOOTOCT 360 

ACTGOTOATT TTOCTOOCTC TAATTCCCAA ATOGCTCAAO TCOOTOACGO TOATAATTCA 420 

CCTTTAATOA AT AATTTCCO TCAATATTTA CCTTCCCTCC CTCAATCGGT TOAATOTCOC 

CCTTTTOTCT TTAOCOCTGO TAAACCATAT OAATTTTCTA TTGATTOTOA CAAAATAAAC 

TTATTCOGTG TCTTTGCOTT TCTTTTAT AT OTTOCCACCT TTATGTATGT ATTTTCTACO 

TTTOCTAACA TACTOCGTAA TAAOOAGTCT TAATCATOCC AOTTCTTT TO OGTATTCCOT 660 

6 6 6 

TATTAT 

( 2 ) INFORMATION POR SEQ ID NO: 113: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 708 toe pair* 
( B ) TYPE: nucleic «id 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: hnen 

( i i ) MOLECULE TYPE: DNA (gecotnic) 



4 8 0 

5 4 0 

6 0 0 



( i i i ) HYPOTHETICAL: NO 
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( i v ) ANT] -SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQID NO: 113: 



OAG ACG ACT A 


GTGGTGOCGO 


TOGCTCTCCA 


TTCOTTTGTG 


AATATCA AGO 


CCAAGOCCAA 


6 0 


TCOT CTOACC 


TGCCTCAACC 


TCCT GT C A AT 


GCTGGCGGCO 


OC TC TOO TOO 


TOGTTCTGOT 


1 2 0 


GGCOGCT CTO 


ACOOTGG TGG 


CTCTGAGOOT 


OGCGOT T CTG 


AGOGTOGCOO 


C T CTG AGGOA 


1 8 0 


GOCGOT T CCG 


OTOGTOGCTC 


TGGTTCCGGT 


GATTTTGATT 


ATO A A AAG A T 


OOCA AACGCT 


2 40 


A AT A AGOOOO 


CTATOACCOA 


AAATOCCGAT 


G AA A ACOCGC 


T AC AGTC TG A 


CO CT AAAOOC 


3 0 0 


AAACTTOATT 


CTOTCOCT AC 


TGATTACGGT 


GCTGCT ATCO 


ATOGTTTC AT 


TOGTGACOTT 


3 6 0 


TCCGOCCTTO 


C T A A TOOTA A 


TOG TOCT ACT 


OGTO ATTTTO 


CTGGCT CTA A 


T TCCCAAATG 


4 2 0 


GCTCA AOTCG 


OTG AC GGTGA 


T AATTC ACCT 


TT AATGAATA 


ATTTCCOTCA 


ATATTTACCT 


4 8 0 


TCCCTCCCTC 


AA TCGGTTGA 


ATOT COCCCT 


TTTOTCTTTA 


GCOCTOG T AA 


ACCAT ATOAA 


5 4 0 


TTTT CTATTG 


ATTOTGACAA 


AATAAACTTA 


TTCCOTOOTO 


TCTT TOCOTT' 


TCTTTTATAT 


6 00 


OTTOCC ACCT 


T T ATOT ATOT 


ATTTTCTACG 


TTTGCT AAC A 


T ACTOCOTAA 


TAAGGAGTCT 


6 6 0 


T AATCATOCC 


AGTTCTTTTG 


OOT AT TC CO T 


TATTATOCTA 


OCT AO T AA 




7 0 8 



6 0 



( 2 ) INFORMATION FOR SEQ ID NO: 114: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 201 b»e pan 
< B ) TYPE: nuckk acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: fioear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

(l i ) SEQUENCE DESCRIPTION: SEQ ID NO: 11 4c 
TATOCTAOCT AOTAACACGA CAGGTTTCCC GACTOOAAAO COOGCAOTOA OCOCAACOCA 
ATTAATOTGA GTTAOCTCAC TCATTAOGCA CCCCAOOCTT TACACTTTAT OCTTCCOGCT 120 
COTATOTTOT OTGOAATTOT OAOCOOATAA CAATTTCACA CAOGAAACAO CTATOACCAT 180 
OATTACOAAT TCOAOCTCOO T 101 

{ 2 ) INFORMATION FOR SEQ ID NO: 115: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: »0 base par* 
( B >TYPE: uclck acid 
( C > STRANDEDNESS: doabk 
( D ) TOPOLOGY: finear 

( i i ) MOLECULE TYPE: DNA {genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTVSENSE: NO 

( z i ) SEQUENCE DESCRIPTION: SEQ CD NO: 115: 
AGOTCCAOCT TCTCGAOTCT GOACCTOOCC TCGTOAAACC TTCTCAGTCT CTGTCTCTCA 60 
CCTOCTCTOT CACTOACTAC TCCATCACCA GTGCTTATTA CTOOAACTOO ATCCOGCAOT 120 
TTCCAOOAAA CAAAC TOOAA TOOATOOOCT ACATAAOCTA COACOOTOTC AATAAGTATG 180 
ATCCATCTCT CAAOAATCGA ATCTCCATCA CTCOTOACAC ATCTAACAAT CAOTTTTTCC 240 
AOAAOTTOAT TTCTGTOACT TCTOAOGACA CAOOAACATA TOACTOTTCA AOAOOOACTA 300 
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GOOCCTCTGC 


T ATOOACTAC 


TOOGGTCAAG 


GAAT TTCAGT 


C ACCOTCTCC 


TCAOCCAAAA 


3 6 0 


COAC ACCCCC 


ATCTOTC TAT 


CCACTGOCCC 


CTOG ATCTOC 


TGCCCA AACT 


AACTCCATGG 


4 2 0 


TOACCCTOOO 


ATOCCTOOTC 


AAOOGCTATT 


TCCCTOAGCC 


AOTOAC AOTG 


ACCTGGAACT 


4 8 0 


CTOOAT CCCT 


OTCCAOCOOT 


GTOCACACCT 


TCCC AOCTOT 


CCTOCAOTCT 


0 ACCTCTAC A 


5 4 0 


CTCTOAOCAO 


CTCAQTOACT 


OTCCCCTCCA 


GCCCTCGGCC 


CAGCOAO ACC 


GTCACCTGC A 


6 0 0 


ACOTTOCCCA 


CCCOOCCAOC 


AGCACCAAOG 


TOG ACAAGA A 


A ATTGTGCCC 


AOGOATTGTA 


6 6 0 


CT AOTOCTO A 


OOOTOACOAT 


CCCOCAAAAO 


COGCCTTTAA 


CT CCCTGC AA 


GCC T C AGCO A 


7 2 0 


CCGAAT AT AT 


COGTT ATOCO 


TGGOCOATOO 


TTGT TOTCAT 


T OTCOGC OCA 


ACT ATCGOT A 


7 8 0 


TCAAOCTOTT 


T AAOAAATTC 


ACC TCGAAAO 


CAAOCTGATA 


GAATTCGAGT 




8 3 0 



2 6 0 



t 2 ) INFORMATION FOR SBQ ID NCkn6: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 260 base pairs 
( B ) TYPE: nucleic acid 
f C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:116: 
ATGAAATACC TATTGCCTAC OGCAGCCOCT GOATTOTTAT TACTCGCTOC CCAACCAOCC 60 
ATGOCCCAGO TOAAACTGCT COAGATTTCT AOACTAOTOC TOAOOOTOAC GATCCCOCAA J 20 

AAGCOOCCTT TAACTCCCTO CAAGCCTCAG COACCOAATA TATCGOTTAT GCOTOOGCGA 180 
TOGTTOTTOT CATTOTCGOC GCAAC TATCO OTATCAAGCT OTTTAAGAAA TTCACCTCGA 240 
AAOCAAOCTO ATAGAATTCG 

( 2 ) INFORMATION FOR SBQ ID NO: 1 17: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: <6i ho»o pair* 
( B ) TYPE: nucleic tad 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: Bneix 

( i i ) MOLECULE TYPE: DNA (genomic) 

(tii ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NO:] 17: 

OTACOCGCCC TGTAOCOOCG CAT TAAGCOC GGCOGOTOTO OTOOTTACGC GCAOCOTGAC 

COCTACACTT GCCAOCGCCC TAGCOCCCOC TCCTTTTGCT TTCTTCCCTT CCTTTCTCGC 

CACOTTCOCC GGCTTTCCCC OTCAAOCTCT AAATCOOOOG CTCCCTTTAO GGTTCCOATT 180 

IAOTOCTTTA COOCACCTCO ACCCCAAAAA ACT TOAT TAG OOTOATOOTT CACGTAOTOO 240 

OCCATCOCCC TOATAGACOO TTTTTCGCCC TTTOACOTTO OAGTCCACGT TCTTTAATAO 300 

TOOACTCTTG TTCCAAACTG GAACAACACT CAACCCTATC TCOOTCTATT CTTTTOATTT 360 

ATAAOOOATT TTOCCOATTT COOCCTATTO GTT A AAA AAT OAOCTOATTT AACAAAAATT 420 

TAACGCOAAT TTTAACAAAA TATTAACOTT TACAATTTAA A 



6 0 
2 0 



4 6 1 
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< 2 ) rhTORMATlON FOR SEQ ID N0:11S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: lingfe 
( D ) TOPOLOGY: line* 

( i i ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:ll8: 
OCAATAAACC CTCACTAAAO GO 



( 2 ) INFORMATION FOR SEQ ID N0:119: 

( k > SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 21 base pain 
( B ) TYPE: aoclck acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

< i i ) MOLECULE TYPE: DNA (genomic) 

( k i i ) HYPOTHETICAL: NO 

( iv ) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 9: 
TCTCOCACAA TAATACACOO C 



( 2 ) INFORMATION FOR SEQ ID NO- 120: 

{ i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH : M base pem 
( B ) TYPE: pockac acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i y ) ANTI-SENSE: NO 

( s i ) SEQUENCE DESCRIPTION: SEQ ID NO:120: 
OTOTATTATT QTOCGAOANN SNNSNNSNNS NNSNNSNNSN NSNNSNNSNN SNNSNNSNNS 
NNSNNSTOGG 0CCAAG00AC CACO 



( 2 ) INFORMATION FOR SEQ ID NO: 12 1 : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base panm 
{ B ) TYPE: mxidc acid 
( C } STRANDEDNESS: tingle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:121: 
OCATOTACTA OTTTTOTCAC AAOATTTGOO 



( 2 ) INFORMATION FOR SEQ ID NO:l22: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 18 base pair) 
( B ) TYPE: mckic acid 
( C ) STRANDEDNESS: imgte 
( D ) TOPOLOGY: Hntusr 

< i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

' ( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

TOGOGCCAAG GGACCACO 



( 2 ) INFORMATION FOR SEQ ID NO:123: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 1ft base pain 
( B ) TYPE: oodetc and 
( C ) STRANDEDNESS: imjje 
( D ) TOPOLOOY: Hnear 

( i j ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:123: 
GTGTATTATT GT GC O AG A 



( 2 ) INFORMATION FOR SEQ ID NO-.124: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 base pain 
( B ) TYPE: nucleic acid 
{ C ) STRANDEDNESS: single 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i x ) FEATURE: 

( A ) NAME/KEY: repcat_xcgk» 
( B ) LOCATION: 19. 21 

( D ) OTHER INFORMATION: /ipt_iypca*tandem** 

/ •Dte3 H NNS can be repeated &om 3 to about 24 

( i i ) SEQUENCE DESCRIPTION: SEQ CD NO: 124: 
GTGTATTATT OTGCGAOANN STOOGOCCAA OOGACCACO 



( 2 ) INFORMATION FOR SEQ tD NO: 125: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 base pain 
( B ) TYPE: oucktc acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i y ) ANTI-SENSE: NO 

( i x ) FEATURE: 

( A ) NAME/KEY: repeat_j*gio«, 
( B ) LOCATION: 19.21 

( D ) OTHER INFORMATION: *pt_Jype*"taiidenr 
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/ oote^NNK can be repealed from 3 to about 24 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:125: 
GTGTATTATT GTCCOAOANN KTOGGGCCAA GOGACCACG 



3 9 



What is claimed is: 10 
1. A method for producing a library of complementarity 
determining region (CDR)-rautagenized phage-displayed 
immunoglobulin heterodimers. the method comprising the 
steps of: 

1) amplifying a CDR portion of a template immunoglo- 15 
bulin variable domain gene selected from the group 
consisting of a template immunoglobulin heavy chain 
variable domain gene and a template immunoglobulin 
light chain variable domain gene, wherein said tem- 
plate immunoglobulin heavy and light chain genes ^ 
have a framework region and said CDR portion and 
encode respective heavy and light chain variable 
domain polypeptides, and wherein said amplifying is 
by polymerase chain reaction (PCR) using a PCR 
primer oligonucleotide for mutagenizing a preselected 25 
nucleotide region in said CDR portion, thereby forming 

a library of amplified CDR-rautagenized immunoglo- 
bulin gene fragments, said PCR primer oligonucleotide 
having 3* and 5' termini and comprising: 

a) a nucleotide sequence at the 3' terminus capable of ^ 
hybridizing to a first framework region of said 
selected template immunoglobulin variable domain 
gene; 

b) a nucleotide sequence at the 5' terminus capable of 
hybridizing to a second framework region of said 35 
selected template immunoglobulin variable domain 
gene; and 

c) a nucleotide sequence between the 3' and 5' termini 
according to the formula selected from the group 
consisting of: 40 

[NNSU and [NNKL,. 
wherein N is independently any nucleotide. S is G or C and 
K is G or T, and d is 3 to about 24. the 3' and 5' terminal 
nucleotide sequences having a length of about 6 to 50 45 
nucleotides, or an oligonucleotide having a sequence 
complementary thereto; 

2) inserting individual members of the library of amplified 
CDR-mutagenized immunoglobulin gene fragments 
formed in step (1) into a dicistronic phagemid expres- 50 
sion vector comprising immunoglobulin heavy and 
light chain variable domain genes that lack the immu- 
noglobulin gene portion corresponding to the fragment . 
to be inserted, wherein upon insertion said vector is 
capable of expressing heavy and light chain variable 55 
domain polypeptides encoded by said vector, thereby 
forming a library of dicistronic expression vectors 
containing amplified CDR-mutagenized immunoglo- 
bulin gene fragments; and 

3) expressing said immunoglobulin heavy and light chain 60 
genes in the library of dicistronic expression vectors 
formed in step (2) whereby said encoded heavy and 
light chain variable domain polypeptides assemble on 
the surface of a phage to form a phage-displayed 
immunoglobulin heterodimer. thereby producing a 65 
library of CDR-mutagenized phage-displayed immu- 
noglobulin heterodimers. 



2. The method of claim 1 wherein said 3' terminus has the 
nucleotide sequence 5'-TGGGGCCAAGGGACCACG-3* 
(SEQ ID NO 122). or an oligonucleotide having a sequence 
complementary thereto. 

3. The method of claim 1 wherein said 5' terminus has the 
nucleotide sequence 5' - GTGTATTATTGTGCG AG A-3' 
(SEQ ID NO 123) or an oligonucleotide having a sequence 
complementary thereto. 

4. The method of claim 1 wherein said template immu- 
noglobulin heavy and light chain genes are obtained from a 
human. 

5. The method of claim 1 wherein said CDR portion is 
CDR3. 

6. The method of claim 1 wherein said formula is: 

5 1 -GTrrrATTATTGTGCG AG A |NNS ]„TGGGGCCAAGGGAC - 
CACG-r (SEQ ID NO 124). 

7. The method of claim 1 wherein n is 16 in the formula 
|NNS] n (SEQ ID NO 120). 

8. The method of claim 1 wherein said formula is: 

5-OTtrrAriArrGTCK:GAGAf>rra 

CACG-3* (SEQ ID NO 1 25). 

9. A method for producing a complementarity determin- 
ing region (CDR-mutagenized phage-displayed immuno- 
globulin heterodimer with altered antigen binding 
specificity, the method comprising the steps of: 

1) amplifying a CDR portion of a template immunoglo- 
bulin variable domain gene selected from the group 
consisting of a template immunoglobulin heavy chain 
variable domain gene and a template immunoglobulin 
light chain variable domain gene, wherein said tem- 
plate immunoglobulin heavy and light chain genes 
have a framework region and said CDR portion and 
encode respective heavy and light chain variable 
domain polypeptides having a preselected antigen 
binding specificity to a first antigen, and wherein said 
amplifying is by polymerase chain reaction (PCR) 
using a PCR primer oligonucleotide for mutagenizing a 
preselected nucleotide region in said CDR portion to 
alter said preselected antigen binding specificity, 
thereby forming a library of amplified CDR- 
mutagenized immunoglobulin gene fragments, said 
PCR primer oligonucleotide having 3' and 5' termini 
and comprising: 

a) a nucleotide sequence at the 3' terminus capable of 
hybridizing to a first framework region of said 
selected template immunoglobulin variable domain 
gene; 

b) a nucleotide sequence at the 5* terminus capable of 
hybridizing to a second framework region of said 
selected template immunoglobulin variable domain 
gene; and 

c) a nucleotide sequence between the 3* and 5' terniini 
according to the formula selected from the group 
consisting of: 

[NNSL aoJ fNNKU 
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wherein N is independently any nucleotide. S is G or C and 
K is G or T. and n is 3 to about 24. the 3' and 5* terminal 
nucleotide sequences having a length of about 6 to 50 
nucleotides, or an oligonucleotide having a sequence 
complementary thereto; 5 

2) inserting individual members of the library of amplified 
CDR-mutagenized immunoglobulin gene fragments 
formed in step (1) into a dicistronic phagemid expres- 
sion vector comprising immunoglobulin heavy and 
light chain variable domain genes that lack the immu- 
noglobulin gene portion corresponding to the fragment 
to be inserted, wherein upon insertion said vector is 
capable of expressing heavy and light chain variable 
domain polypeptides encoded by said vector, thereby 
forming a library of dicistronic expression vectors 
containing amplified CDR-mutagenized immunoglo- 
bulin gene fragments; 

3) expressing said immunoglobulin heavy and light chain 
genes in the library of dicistronic expression vectors 
formed in step (2) whereby said encoded heavy and 
light chain variable domain polypeptides assemble on 
the surface of a phage to form a phage-displayed 
immunoglobulin heterodimer, thereby producing a 
library of CDR-mutagenized phage-displayed immu- 
noglobulin heterodimers; and 

4) immunoreacting members of the library of CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimers formed in step (3) on a preselected second 
antigen, said second antigen being different than said 
first antigen to allow for selection of a CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimer with altered antigen binding specificity. 

10. The method of claim 9 wherein said 3* terminus has 
the nucleotide sequence 5'-TGGGGCCAAGGGACCACG- 
3* (SEQ ID NO 122). or an oligonucleotide having a 
sequence complementary thereto. 

11. The method of claim 9 wherein said 5' terminus has 
the nucleotide sequence 5 f -GTGTATTArTGTGCGAGA-3' 
(SEQ ID NO 123) or an oligonucleotide having a sequence 
complementary thereto. 

1Z The method of claim 9 wherein said template immu- 
noglobulin heavy and light chain genes are obtained from a 
human. 

13. The method of claim 9 wherein said CDR portion is 
CDR3. 

14. The method of claim 9 wherein said formula is 

5' ^nXTEATTATICTGCG AG A(NNS kTGGGGCC AAGGG AC - 
CACG-3* (SEQ ID NO 124). 

15. The method of claim 9 wherein n is 16in the formula 
INNS)„ (SEQ ID NO 120). 

16. The method of claim 9 wherein said formula is: 

5'<)TGTXnxrrGTCCGAGA(NNK^ 
CACG-3' (SEQ ID NO 125). 

17. A method for producing a soluble complementarity 
determining region (CDR>mutagenized immunoglobulin 
heterodimer with altered immunoreactivity to a preselected 
antigen, the method comprising the steps of: 

1) amplifying a CDR portion of a template immunoglo- 
bulin variable domain gene selected from the group 
consisting of a template immunoglobulin heavy chain 
variable domain gene and a template immunoglobulin 
light chain variable domain gene, wherein said tem- 
plate immunoglobulin heavy and light chain genes 
have a framework region and said CDR portion and 
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encode respective heavy and light chain variable 
domain polypeptides immunoreactive with a prese- 
lected antigen, and wherein said amplifying is by 
polymerase chain reaction (PCR) using a PCR primer 
oligonucleotide for mutagenizing a preselected nucle- 
otide region in said CDR portion to alter said immu- 
noreactivity of said immunoglobulin heterodimer to the 
preselected antigen, thereby forming a library of ampli- 
fied CDR-mutagenized immunoglobulin gene 
fragments, said PCR primer oligonucleotide having 3* 
and 5* termini and comprising: 

a) a nucleotide sequence at the 3' terminus capable of 
hybridizing to a first framework region of said 
selected template immunoglobulin variable domain 
gene; 

b) a nucleotide sequence at the 5* terminus capable of 
hybridizing to a second framework region of said 
selected template immunoglobulin variable domain 
gene; and 

c) a nucleotide sequence between the 3' and 5* termini 
according to the formula selected from the group 
consisting of; 

[NNSU and fNNKU, 
wherein N is independently any nucleotide. S is G or C and 
K is G or T. and n is 3 to about 24. the 3* and 5* terminal 
nucleotide sequences having a length of about 6 to 50 
nucleotides, or an oligonucleotide having a sequence 
complementary thereto; 

2) inserting individual members of the library of amplified 
CDR-mutagenized immunoglobulin gene fragments 
formed in step (1) into a dicistronic phagemid expres- 
sion vector comprising immunoglobulin heavy and 
light chain variable domain genes that lack the immu- 
noglobulin gene portion corresponding to the fragment 
to be inserted, wherein upon insertion said vector is 
capable of expressing heavy and light chain variable 
domain polypeptides encoded by said vector, thereby 
forming a library of dicistronic expression vectors 
containing amplified CDR-mutagenized immunoglo- 
bulin gene fragments; 

3) expressing said immunoglobulin heavy and light chain 
genes in the library of dicistronic expression vectors 
formed in step (2) whereby said encoded heavy and 
light chain variable domain polypeptides assemble on 
the surface of a phage to form a phage-displayed 
immunoglobulin heterodimer. thereby producing a 
library of CDR-mutagenized phage-displayed immu- 
noglobulin heterodimers; 

4) immunoreacting members of the library of CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimers produced in step (5) on said preselected 
antigen to isolate an immunoglobulin heterodimer hav- 
ing altered immunoreactivity; 

5) isolating said immunoreacted CDR-mutagenized 
phage-displayed immunoglobulin heterodimer 
obtained in step (4); 

6) producing a soluble form of said immunoreacted form 
of immunoreacted CDR-mutagenized phage-displayed 
immunoglobulin heterodimer isolated in step (5); and 

60 7) assaying said soluble form of immunoreacted CDR- 
mutagenized immunoglobulin heterodimer prepared in 
step (6) to identify a CDR-mutagenized immunoglo- 
bulin heterodimer with altered immunoreactivity to the 
preselected antigen. 
65 18. The method of claim 17 wherein said assaying is 
determined by an increase in affinity to the preselected 
antigen. 
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19. The method of claim 18 wherein the affinity is greater 
than lO^M" 1 dissociation constant (Kj). 

20. The method of claim 17 wherein said 3* terminus has 
the nucleotide sequence 5*-TGGGGCCAAGGGACCACG- 
J (SEQ ID NO 122), or an oligonucleotide having a 
sequence complementary thereto. 

21. The method of claim 17 wherein said 5* terminus has 
the nucleotide sequence 5* -C^<JTAXt fiTT(JTGCG AG 
(SEQ ID NO 123) or an oligonucleotide having a sequence 
complementary thereto. 

22. The method of claim 17 wherein said template immu- 
noglobulin heavy and light chain genes are obtained from a 
human. 
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23. The method of claim 17 wherein said CDR portion is 
CDR3. 

24. The method of claim 17 wherein said formula is 

S'-GTGTATTXrTGTGCGAC A|NNS |„TGGGGCC AAGGGAC- 
CACG-3' (SEQ ED NO 124). 

25. The method of claim 17 wherein n is 16in the formula 
INNS),, (SEQ ID NO 120). 

26. The method of claim 17 wherein said formula is: 

5 '-GTGTATTATTGTGCGAGATNNK ] „TGGGGCC AAGGG AC- 
. CACG-3' (SEQ ID NO 125). 

***** 
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T cell receptors (TCRs) exhibit genetic and structural diversity 
similar to antibodies, but they have binding affinities that are 
several orders of magnitude lower. It has been suggested that TCRs 
undergo selection in vivo to maintain lower affinities. Here, we 
show that there is not an inherent genetic or structural limitation 
on higher affinity. Higher-affinity TCR variants were generated in 
the absence of in vivo selective pressures by using yeast display 
and selection from a library of Va CDR3 mutants. Selected mutants 
had greater than 100-fold higher affinity (Kb * 9 nM) for the 
peptide/MHC ligand while retaining a high degree of peptide 
specificity. Among the high-affinity TCR mutants, a strong prefer- 
ence was found for CDR3a that contained Pro or Gly residues. 
Finally, unlike the wild-type TCR, a soluble monomeric form of a 
high-affinity TCR was capable of directly detecting peptide/MHC 
complexes on antigen-presenting cells. These findings prove that 
affinity maturation of TCRs is possible and suggest a strategy for 
engineering TCRs that can be used in targeting specific peptide/ 
MHC complexes for diagnostic and therapeutic purposes. 

T cells recognize a foreign peptide bound to the MHC product 
through the ajS heterodimeric receptor. The T cell receptor 
TCR) repertoire has extensive diversity created by the same 
,ene rearrangement mechanisms used in antibody heavy- and 
light-chain genes (1). Most of the diversity is generated at the 
junctions of V and J (or diversity, D) regions that encode the 
complementarity-determining region three (CDR3) of the a and 
j3 chains (2). However, TCRs do not undergo somatic point 
mutations as do antibodies, and perhaps not coincidentally, 
TCRs also do not undergo the same extent of affinity maturation 
as antibodies. TCRs appear to have affinities that range from 10 5 
to 10 7 M" 1 whereas antibodies have affinities that range from 10 5 
to 10 10 M" 1 (3, 4). 

Whereas the absence of somatic mutation in TCRs may be 
associated with lower affinities, it has also been argued that there 
is not a selective advantage for a TCR to have higher affinity 
(5-7). In fact, the serial-triggering (6) and kinetic proofreading 
(7) models of T cell activation both suggest that very slow 
off-rates (associated with higher affinity) would be detrimental 
to the signaling process. On the other hand, the fastest off-rates 
that have been measurable have been associated with altered 
pMHC that exhibits antagonist activity (8-11). Whereas the 
narrow range of natural TCR affinities has provided some 
evidence for the relationships between off-rates and agonist/ 
antagonist activity, there are also examples that appear to be 
inconsistent with these hypotheses (12, 13). 

There are other possible explanations for why the T cell system 
maintains relatively low TCRrpMHC affinities in vivo. Peptides 
bound within the MHC groove display limited accessible surface 
(14), which may in turn limit the amount of free energy that can 
be generated in the interaction. On the other hand, raising the 
affinity of a TCR by directing the free energy toward the MHC 
lelices: would presumably lead to thymic deletion during nega- 
tive selection (15). Even if such higher-affinity TCR could escape 
thymic deletion, they would likely not maintain the peptide 
specificity required for T cell responses. 



It has not been possible to directly test these possibilities 
because the generation of TCRs with affinities above 10 7 M" 1 
has not been accomplished. In addition to allowing a kinetic 
basis of T cell triggering, high-affinity TCRs could be used to 
more easily explore the role of peptide in pMHC recognition, 
and as quantitative probes for the expression of pMHC on 
various target cells. Because in vivo selection schemes have not 
yielded TCRs with the intrinsic binding affinities of affinity- 
matured antibodies, in this report, we have used an in vitro 
method for the directed evolution of high-affinity TCRs. The 
method relies on the expression of a library of mutant single- 
chain (V/3-linker-Va) TCRs on the surface of yeast, as a fusion 
to the surface protein Aga-2 (16, 17). Our previous'studies have 
shown that the yeast display system could be used to engineer 
variants of the 2C single-chain TCR (scTCR) that were more 
thermally stable and secreted at higher levels (17, 18). The 
stability mutants were isolated by subjecting the entire TCR gene 
to random mutagenesis and selecting for increased surface levels 
with anti-TCR antibodies (17). The mutations that increased 
stability resided at the Va: V/3 interface or on the outside surface 
of V/3 in a region not involved in pMHC binding. To isolate TCR 
with higher affinity for pMHC, in the present study, we mutated 
only the CDR3a loop, which is at the center of the pMHC- 
binding site (19). Our efforts were guided by previous findings 
that this region contributed minimal binding free energy to the 
interaction of the 2C TCR with the pMHC ligand QL9/L d (20), 
suggesting that productive interactions might be improved by 
focusing on this region. Remarkably, selection from a relatively 
small library (10 5 mutants) yielded many different TCRs with up 
to 100-fold increased affinity for QL9/IA The high-affinity 
TCRs retained a high degree of peptide specificity although 
there was some variation in fine specificity among the mutants. 
These findings suggest that the in vitro evolution process de- 
scribed here can be used to isolate TCRs with specificities that 
one defines by selection with appropriate pMHC ligands. 

The high-affinity receptors in this study were derived by 
variation at the VJ junction, the same process that operates very 
effectively in vivo through gene rearrangements in T cells (2). 
The fact that we could readily isolate a diverse set of high-affinity 
TCR in vitro indicates that there is not a genetic or structural 
limitation to high-affinity receptors. This supports the view that 
inherently low affinities of TCRs found in vivo are caused by a 
lack of selection for higher affinity and perhaps a selection for 
lower affinity (5-7). Finally, the high-affinity TCR were used in 
monomeric form to detect pMHC on the surface of target cells, 
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Fig. 1. Flow cytometric analysis of yeast cells that express wild-type and 
mutant 2C TCR on their surface. Yeast cells displaying wild-type (T7) and 
mutant (m6 and m13) scTCR were stained with anti-V08 antibody F23.2 
(120 nM), the specific alloantigenic peptide-MHC, QL9/L d /lg (40 nM), or a null 
peptide MCMV/L d /lg (40 nM). The peptides used in this study were QL9 
(QLSPFPFDL), MCMV (YPHFMPTNL), and p2Ca (LSPFPFDL). Binding was de- 
tected by FITC-conjugated goat anti-mouse IgG F(ab'>2 and analyzed by flow 
cytometry. The negative population (e.g., seen with F23.2 staining) has been 
observed for all yeast-displayed proteins and is thought to be caused by cells 
at a stage of growth or induction that are incapable of expressing surface 
fusion protein (16, 17, 27). 

indicating that soluble forms of the TCR selected with the yeast 
display system can serve as probes for tumor-associated pMHC 
or other T cell-specific ligands. 

Materials and Methods 

Library Construction. The 2C single-chain TCR (scTCR) used as 
the scaffold for directed evolution (T7) contained six mutations 
(0G17E, 0G42E, J3L81S, aL43P, aW82R, and oI118N) that 
have been shown to increase the stability of the TCR but still 
allow pMHC binding (E.V.S., K.D.W., and D.M.K., unpublished 
results; and ref. 18). Mutagenic PCR of the T7 scTCR VaCDR3 
was performed by using an AGA-2-specific upstream primer and 
a degenerate downstream primer 5 ' - CTTTTGTGCCGGATC- 
CAAATGTCAG(SNN) 5 GCTCACAGCACAGAAGTACACG- 
GCCGAGTCGCTC-3'. Underlined bases indicate the positions 
of silent mutations introducing unique BamHl and Eagl restric- 
tion sites. The purified PCR product was digested with Ndel and 
BamHl and ligated to Ata?I-.&zmHI-digested T7/pCT302 (16- 
18). The ligation mixture was transformed into DH10B electro- 
competent Escherichia coli (GIBCO/BRL), and transformants 
were pooled into 250-ml LB supplemented with ampicillin at 
100 pig/ml and grown overnight at 37°C. Plasmid DNA was 
transformed into the yeast strain EB Y100 by the method of Gietz 
and Schiestl (21). 

v Cell Sorting. The yeast library (22) was grown in 2% dextrose/ 
167% yeast nitrogen base/1% Casamino acids (Difco) at 30°C 
to an OD600 = 4.0. To induce surface scTCR expression, yeast 
were pelleted by centrifugation, resuspended to an ODgoo = 1.0 
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Fig. 2. Structure and sequences of the 2C TCR CDR3a. (4) X-ray crystal lo- 
graphic structure of the 2C/dEV8/K b complex with CDR3a aa highlighted. Five 
residues of the 2C VaCDR3 that were randomized by PCR with a degenerate 
primer are shown in red. The adjacent CDR3 residues, Ser-93 and Leu-104 
shown in blue, were retained in the yeast display library because they have 
been shown to be important in pMHC binding (17, 18, 20). (B) Alignment of aa 
sequences of mutant scTCRs isolated by yeast display and selection with 
QL9/L d . Display plasmids were isolated from yeast clones after selection and 
sequenced to determine CDR3a sequences. Mutants ml, m2, m3, m4, m10, 
and ml 1 were isolated after the third round of sorting. All other mutants were 
isolated after the fourth round of sorting. 

in 2% galactose/0.67% yeast nitrogen base/1% Casamino acids, 
and incubated at 20°C for ~24 h. In general, «*10 7 cells per tube 
were incubated on ice for 1 h with 50 /il of QL9/L d /IgG dimers 
(23) diluted in PBS (pH 7.4) supplemented with 0.5 mg/ml BSA. 
After incubation, cells were washed and labeled for 30 min with 
FITC-conjugated goat anti-mouse IgG F(ab')2 (Kirkegaard & 
Perry) in PBS (pH 7.4) supplemented with 0.5 mg/ml BSA. 
Yeast were then washed and resuspended in PBS (pH 7.4) 
supplemented with 0.5 mg/ml BSA immediately before sorting. 
Cells exhibiting the highest fluorescence were isolated by using 
a Coulter 753 bench fluorescence-activated cell sorter. After 
isolation, sorted cells were expanded in 2% dextrose/0.67% 
yeast nitrogen base/1% C^amino acids and induced in 2% 
galactose/0.67% yeast nitrogen base/1% Casamino acids for 
subsequent rounds of selection. A total of four sequential sorts 
were performed. The concentrations of QL9/L d /IgG dimers 
used for staining were 50 Mg/ml for sorts one to three and 
0.5 ptg/ml for the final sort. The percentages of total cells 
isolated from each sort were 5.55, 2.68, 2.56, and 0.58%, 
respectively. Aliquots of sorts three and four were plated on 2% 
dextrose/0.67% yeast nitrogen base/1% Casamino acids to 
isolate individual clones which were analyzed by flow cytometry 
by using a Coulter Epics XL instrument. 

Soluble scTCR Production. The T7 and m6 scTCR genes were 
excised from pCT302 Nhel-Xhol and ligated into Nhel-Xhol 
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digested pRSGALT, a yeast expression plasmid (18). Ligations 
were transformed into DH10B electrocompetent E. coli 
(GIBCO/BRL). Plasmid DNA was isolated from bacterial cul- 
tures and transformed into the Saccharomyces cerevisiae strain, 
BJ5464 (a ura3-52 trpl leu2M his3A200 pep4:JfIS3 prblM6R 
canl GAL) (18). Yeast clones were grown in 1 liter of 2% 
dextrose/0.67% yeast nitrogen base/1% Casamino acids/20 
mg/liter Trp for 48 h at 30°C. To induce scTCR secretion, cells 
were pelleted by centrifugation at 4,000 x g, resuspended in 1 
liter of 2% galactose/0.67% yeast nitrogen base/1% Casamino 
acids/20 mg/1 Trp supplemented with 1 mg/ml BSA, and 
incubated for 72 h at 20°C. Culture supernatants were harvested 
by centrifugation at 4,000 x g, concentrated to ^50 ml, and 
dialyzed against PBS (pH 8.0). The six His-tagged scTCRs were 
purified by native nickel affinity chromatography [Ni-NTA 
Superflow, Qiagen (Chatsworth, CA); 5 mM and 20 mM imi- 
dazole (pH 8.0) wash; 250 mM imidazole elution] (18). 

Cell-Binding Assays. The binding of soluble scTCRs to QL9/L d 
was monitored in a competition format as described (20, 24). 
Peptide-loaded T2-L d cells (3 x 10 5 per well) were incubated for 
1 h on ice in the presence of 125 I-labeled anti-L d Fabs (30-5-7) 
and various concentrations of scTCRs. Bound and unbound 
[ 125 I] 30-5-7 Fabs were separated by centrifugation through olive 
oil/dibutyl phthalate. Inhibition curves were constructed to 
determine inhibitor concentrations yielding 50% maximal inhi- 
bition. Dissociation constants were calculated by using the 
formula of Cheng and Prusoff (25). To monitor direct binding of 
scTCRs to cell-bound pMHC, peptide-loaded T2-L d cells (5 x 
10 5 per tube) were incubated for 40 min on ice with biotinylated 
soluble scTCRs followed by staining for 30 min with streptavi- 
din-phycoerythrin (PharMingen). Cellular fluorescence was de- 
tected by flow cytometry. 

Results and Discussion 

To examine if it is possible to generate higher-affinity TCR that 
would retain peptide specificity, we subjected a TCR to a process 



of directed in vitro evolution. Phage display (26) has not yet 
proven successful in the engineering of single-chain TCRs 
(scTCRs, V/3-linker-Va) despite the extensive structural simi- 
larity between antibody and TCR V regions. However, we, 
recently showed that a scTCR could be displayed on the surface 
of yeast (17) in a system that has proven successful in antibody 
engineering (16, 27). A temperature-stabilized variant (T7) (18) 
of the scTCR from the cytotoxic T lymphocyte clone 2C was used 
in the present study. Cytotoxic T lymphocyte clone 2C recog- 
nizes the alloantigen L d with a bound octamer peptide called 
p2Ca, derived from the enzyme 2-oxoglutarate dehydrogenase 

(28) . The nonameric variant QL9 is also recognized by cytotoxic 
T lymphocyte 2C, but with 10-fold higher affinity by the 2C TCR* 

(29) . Alanine scanning mutagenesis showed that the CDR3a 
loop contributed minimal free energy to the binding interaction 
(20), even though structural studies have shown that CDR3a of 
the 2C TCR is near the peptide and it undergoes a conforma- 
tional change to accommodate the pMHC complex (19). Thus, 
we focused our mutagenesis efforts on five residues that form the 
CDR3a loop. 

A library of 10 5 independent TCR-CDR3a yeast mutants was 
subjected to selection by flow cytometry with a fluorescently 
labeled QL9/L d ligand (23). After four rounds pf sorting and 
growth, 15 different yeast colonies were examined for their 
ability to bind the ligand, in comparison to the scTCR variant T7, 
which bears the wild-type CDR3a sequence (Fig. 1 and data not 
shown). The anti-V/38.2 antibody F23.2 which recognizes resi- 
dues in the CDR1 and CDR2 was used as a control to show that 
wild-type scTCR-T7 and scTCR mutants (m6 and ml3, in Fig. 
1, and others, data not shown) each had approximately equiv- 
alent surface levels of the scTCR (Fig. 1). In contrast, the soluble 
QL9/L d ligand bound very well to each mutant yeast clone but 
not to wild-type scTCR-T7. The MCMV/L d complex, which is 
not recognized by cytotoxic T lymphocyte clone 2C, did not bind 
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Fig. 3. Fine specificity analysis of mutant scTCR binding to different QL9 variant peptides bound to L d .The original T cell clone 2C and various yeast clones were 
analyzed by flow cytometry for binding to L d /lg dimers loaded with wild-type QL9 (QL9-5F) ( position 5 variants of QL9 (QL9-5Y, QL9-5H, and QL9-5E) or MCMV. 
Binding was detected with FITC-labeled goat anti-mouse IgG. Relative fluorescence was measured by comparison with mean fluorescence values of 2C cells or 
yeast cells stained with anti-V08 antibody F23.2. 
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Fig. 4. QL9/L d binding by soluble scTCRs. T2-L* cells loaded with QL9 were 
incubated with ^-labeled anti-L* Fab fragments (30-5-7) and various 
concentrations of unlabeled Fab (□), scTCR-T7 (m), or mutant scTCR-m6 (©). 
Bound and unbound p"l]30-5-7 Fab fragments were separated by cen- 
tnfugation through olive oil/dibutyl phthalate. Binding of 12s Mabeled 
anti-LO Fab fragments to T2-L* cells loaded with the control peptide MCMV 
was not inhibited even at the highest concentrations of scTCRs (data not 
shown). 



to J5l ScTCR mutants or wild-type scTCR-T7, indicating that the 
scTCR mutants retained peptide specificity. 

The CDR3a sequences of the 15 mutants all differed from the 
2C TCR (Fig. 2). It was readily apparent (and confirmed by a 
blast alignment algorithm) that the sequences could be aligned 
uito two motifs. One motif contained Gly in the middle of the 
five residue stretch whereas the other motif contained three 
tandem Pro. Evidence that all three Pro are important in 
generating the highest affinity site is suggested by results with 
mutant mil. Mutant mil contained only two of the three Pro 
and exhibited reduced binding compared with the triple-Pro 
mutants (data not shown). The Gly-containing mutants ap- 
peared to have preferences for positive-charged residues among 
the two residues to the carboxyl side (7/9) and aromatic and/or 
positive-charged residues among the two residues to the amino 
side (4/9 and 5/9). The selection for a glycine residue at position 
102 in the motif may indicate that the CDR3a loop required 
conformational flexibility around this residue to achieve in- 
creased affinity. This is consistent with the large (6-A) confor- 
mational difference observed between the CDR3a loops of the 
hganded and unliganded 2C TCR (19). It is also interesting that 
Gly is the most common residue at the V(D)J junctions of 
antibodies and the presence of a Gly has recently been associated 
with increased affinity in the response to the (4-hydroxy-3- 
nitrophenyl) acetyl hapten (30). 

In contrast to the isolates that contain Gly, the selection for 
a Pro-rich sequence at the tip of the CDR3a loop may sugaest 
that these TCR exhibit a more rigid conformation that confers 
higher affinity. The x-ray crystallographic structures of a germ- 
line antibody of low affinity compared with its affinity-matured 
derivative showed that the high-affinity state was associated with 
stabilization of the antibody in a configuration that accommo- 
dated the hapten (31). Similarly, the NMR solution structure of 
a scTCR that may be analogous to the germ-line antibody 
showed that the CDR3a and jS loops both exhibited significant 
mobility (32). Recent thermodynamic studies of TCR.pMHC 
interactions have also suggested the importance of conforma- 
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Fig 5. Flow cytometric analysis of the binding of scTCR/biotin to cell 
surface peptide/MHC. Peptide-loaded T2-L< cells were incubated with 
btotinylated m6 scTCR (-0.3 ,*M) or T7 scTCR (-1.6 /iM) scTCR followed by 
streptavid.n-PE and analyzed by flow cytometry. (4) Flow cytometry his- 
»£17 L° f , T2 ; L Ce ' IS load6d With QL9 Shaded), p2C a (light shade), or 
MCMV (dark shade) and stained with m6 scTCR/biotin. (B) Mean fluores- 
cent units (MFU) of T2-L< cells loaded with QL9, p2Ca, or MCMV and stained 
with either secondary SA-PE only, T7 scTCR/biotin + SA-PE, or m6 scTCR/ 
biotin + SA-PE. 



tional changes in binding (33, 34). Structural and thermodynamic 
studies of the TCR mutants that we report here should allow us 
to examine if the two CDR3o motifs (Gly- versus Pro-rich) might 
differ in the mechanism by which they confer higher affinity. 

Although the scTCR mutants did not bind the null peptide/L d 
complex MCMV/L", it remained possible that the increase in 
affinity might be accompanied by a change in fine specificity To 
examine this issue, we used QL9 position 5 (Phe) peptide 
variants that have been shown previously to exhibit significant 
differences m their binding affinity for the wild-type 2C TCR 
(35). The binding of these pMHC to various TCR mutants on the 
yeast surface and clone 2C were measured by flow cytometry. As 
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shown in Fig. 3, the native TCR on 2C is capable of binding QL9 
variants that contain either Tyr or His at position five but not 
Glu. Each of the higher-affinity TCR mutants retained their 
ability to recognize the conserved Tyr-substituted peptide and 
they were likewise incapable of recognizing the Glu-substituted 
peptide. However, several of the TCR mutants (m6, m9, and 
ml3) bound to the His-substituted peptide (albeit to different 
extents) whereas other mutants (m5, m7, and ml4) did not bind 
the peptide. Thus, the CDR3a loop can influence the peptide 
fine specificity of recognition but it is not the only region of the 
TCR involved. The effect on peptide specificity could be through 
direct interaction of CDR3a residues with the variant peptide, as 
suggested from earlier studies involving CDR3-directed selec- 
tions (36, 37). Alternatively, binding free energy maybe directed 
at peptide-induced changes in the L d molecule itself. The latter 
possibility is perhaps more likely in the case of the 2C 
TCR:QL9/L d interaction, because position five of QL9 has been 
predicted to point toward the L d groove (35, 38). The fine- 
specificity analysis also shows that it is possible to engineer TCRs 
with increased, or at least altered, specificity for cognate pep- 
tides. Thus, directed evolution of only a short region (CDR3a) 
of a single TCR allows the design of TCR variants with altered 
peptide-binding specificities. 

To determine the magnitude of the affinity increases associated 
with a selected CDR3a mutant, the wild-type T7 scTCR and the m6 
scTCR were expressed as soluble forms in a yeast secretion system. 
Purified scTCR preparations were compared for their ability to 
block the binding of a l25 I-labeled anti-L d Fab fragments to QL9 or 
MCMV loaded onto L d on the surface of T2-L d cells. As expected, 
neither T7 nor m6 scTCR were capable of inhibiting the binding of 
125 I-Fab fragments to T2-L d cells up-regulated with the MCMV 
peptide (data not shown). However, both T7 and m6 were capable 
of inhibiting the binding of anti-L d Fab fragments to QL9/L d (Fig. 
4). The m6 scTCR variant was as effective as unlabeled Fab 
fragments in inhibiting binding, whereas the T7 scTCR was 160-fold 
less effective (average of 140-fold difference among four indepen- 
dent titrations). The K D values of the scTCR for QL9/L d were 
calculated from the inhibition curves to be 1.5 fjM for T7 and 9.0 
nM for m6. The value for T7 is in close agreement with the 3.2 yM 
K D reported for the 2C scTCR (39). These findings show that the 
yeast system, combined with CDR3a-directed mutagenesis, is 
capable of selecting mutants with 100-fold higher intrinsic binding 
affinities for a pMHC ligand. 

If the soluble scTCR has a high affinity for its pMHC ligand, 
then it may be useful, like antibodies, as a specific probe for 
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cell surface-bound antigen. To test this possibility, the soluble 
T7 and m6 scTCR were biotinylated and the labeled scTCR 
were incubated with T2-L d cells loaded with QL9, p2Ca, or 
MCMV. The m6 scTCR, but not the T7 scTCR, yielded easily 
detectable staining of the T2 cells that had been incubated with 
QL9 or p2Ca (Fig. 5 A and B). It is significant that p2Ca-up- 
regulated cells were also readily detected by m6 scTCR, 
because p2Ca is the naturally processed form of the peptide 
recognized by the alloreactive clone 2C and it has an even 
lower affinity than the QL9/L d complex for the 2C TCR (29). 
However, it remains to be determined if the levels of pMHC 
derived from endogenous antigen processing are sufficient to 
allow detection by using soluble TCR as probes. It is reason- 
able to predict that, in some cases, the level will be too low to 
distinguish from background by using standard flow cytometry 
procedures. 

The high-affinity receptors described in our study were de- 
rived by variation at the VJ junction, the same process that 
operates very effectively in vivo through gene rearrangements in 
T cells (2). The fact that we could readily isolate a diverse set of 
high-affinity TCR in vitro indicates that there is not a genetic or 
structural limitation to high-affinity receptors. This supports the 
view that inherently low affinities of TCRs found in vivo are 
caused by a lack of selection for higher affinity and perhaps a 
selection for lower affinity (5-7). In this respect, the higher- 
affinity TCRs now provide the reagents for directly testing 
hypotheses about the effects of affinity on T cell responses (4-7). 
It is interesting to note that similar arguments have been used to 
suggest that the kinetic properties of antibodies may also set an 
in vivo "affinity ceiling," above which there may not be a selective 
advantage to B cells (40). 

In addition to their utility for testing T cell responses, 
high-affinity TCRs can be engineered like antibodies to yield 
high-affinity, antigen-specific probes. Soluble versions of 
the high-affinity receptor can directly detect specific peptide/ 
MHC complexes on cells (Fig. 5). Thus, these engineered 
proteins have potential, for example, as tumor cell diagnostics, 
or on conjugation with cytotoxins, potential agents for cancer 
therapy. 

We thank Michele Kieke for providing 2C scTCR genes cloned in the 
yeast display vector and Gary Durack of the University of Illinois 
Biotechnology Center Flow Cytometry Facility for assistance and advice. 
We also thank Jeff Bluestone and Ian Wilson for comments and 
discussion on the manuscript and results. This work was supported by 
National Institutes of Health Grant ROl GM55767. 
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REAL PARTY IN INTEREST 
The real party in interest is Altor Bioscience Corporation of Miramar, Florida. An 
assignment from the inventors to Dade International was recorded on August 18, 1997 at Reel/ 
Frame 8681/0081. An assignment from Dade International to Sunol Molecular Corporation was 
recorded on February 26, 2003 at Reel/Frame 013787/0276. An assignment from Sunol 
Molecular Corporation to Altor Bioscience Corporation was mailed to the USPTO on February 
3, 2004. 

RELATED APPEALS AND INTERFERENCES 
There are no related appeals or interferences known to Appellants or Appellants' 
representatives that will directly affect or be directly affected by or have a bearing on the Board f s 
decision in the pending Appeal. 

STATUS OF THE CLAIMS 
Claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 stand finally rejected under 35 U.S.C. §103 and 
are pending on appeal. Claims 5, 6, 10-12, 16-20, 61, 65, 70^ 73 and 74 have been cancelled. 
Claims 3, 9, 13, 15, 21-60, 62-64, 66 and 68 were withdrawn from consideration by the 
Examiner. 

STATUS OF THE AMENDMENTS 
Claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 on appeal are set forth in Exhibit A hereto. 
There has been no further amendment to the claims. 

SUMMARY OF THE CLAIMED INVENTION 
The claimed invention features a soluble fusion protein engineered to include a 
bacteriophage coat protein fused to a single-chain T cell receptor ("scTCR"). The single-chain T 
cell receptor was itself designed to include an alpha- variable region ("V-a") fused to a beta- 
variable region ("V-p"). The single-chain T cell receptor forms a pocket that binds antigen when 
the antigen. The claimed soluble fusion protein further includes a beta-constant region ("C-p") 
region that can be fused to V-p, for example. 
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T cells help defend the body against infection. The cells have membrane bound receptors 
that bind foreign antigen with the assistance of a protein complex called "MHC". A key receptor 
is called the T cell receptor ("TCR"). The chemical structure and function of the TCR has been 
extensively studied. For instance, it is known that formation of a TCR-antigen-MHC complex is 
an important step toward fighting infection. 

Appellants discovered that by adding a bacteriophage coat protein to the scTCR, it is 
possible to produce a fully soluble and functional scTCR. Unlike prior scTCRs, the claimed 
fusion proteins were found to be fully soluble, functional, and obtainable in significant quantities 
without difficulty. The claimed fusion proteins have a wide spectrum of important uses as 
described throughout the instant patent application. 

See the Summary Of The Invention at pg. 3, line 22 to pg. 1 1, line 21. See also the 
Background at pg. 1, line 12 to pg. 3, line 19 for related information. None of the art of record in 
this case shows an attempt to make a scTCR that includes a fused bacteriophage coat protein. 

ISSUE 

This appeal presents the issue of whether the Examiner erred in rejecting claims 1, 2, 4, 7, 
8, 14, 67, 69, 71 and 72 under 35 U.S.C. § 103 in view of Chung, S. et al. (1994) Proa Natl. 
Acad. Set (USA) 91: 12654 in view of U.S Pat. No. 5,759,817 to Barbas, Onda, T et al. (1996) 
Mol Immunol 32: 1387; and Huse et al. (1992) J. Immunol 149: 3914. Appellants will refer to 
these citations as "Chung", "Barbas", "Onda" and "Huse", respectively; unless stated otherwise. 
There are no other pending rejections of record in this case. 

GROUPING OF THE CLAIMS 
All of claims 1 , 2, 4, 7, 8, 14, 67, 69, 71 and 72 stand or fall together for the purpose of 
the present appeal. 

CASE HISTORY 
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In consideration of the Examiner's position in this case, Appellants have summarized the 
prosecution history with respect to the 35 U.S.C. §103 rejection at issue. A more detailed 
discussion of Appellants' rebuttal to those arguments will follow under Argument . 



A. A patent application was filed on March 7, 1997 with 59 claims and assigned 
Serial No.: 08/813,781 by the USPTO. 



B. An Office Action was mailed to Appellants on August 25, 2000 by Examiner 
Schwadron. The Examiner stated the following with respect to the §103 rejection on appeal: 

Claims 1, 2, 4, 7, 8, 14, 67, 69, 71, 72 are rejected under 35 U.S.C. 103(a) 
as unpatentable over Chung et al. in view of Barbas US 5,759,817 (filed Jan. 27, 
1992), Onda et al. (Molecular Immunology 32:1387, 1995), and Huse et al. J. 
Immunology 149:3914. 1992 

Chung et al. teaches a single chain T cell receptor which specifically binds 
to peptide ligand (see abstract). Chung et al. further teaches one embodiment of 
human single chain TCR in which C-terminus of V a domain is linked to N- 
terminus of V p chain via a 15 amino acid residue flexible amino acid liner and 
the C-terminus of the V p chain is linked to the beta chain constant domain (see 
Figure 1). In one embodiment the C terminus of V p chain is linked to a alkaline 
phosphatase (PI) protein tag (see page 12655). Chung et also teach that the 
purpose of the linker is to enhance the binding characteristics of the soluble T cell 
receptor and that linkers of about 10 to 30 amino acid residues would be 
considered to be sufficient. Chung et al. teach that the TCR fusion protein can 
bind antigenic protein, thus teaching that the TCR fusion protein comprises an 
antigen binding pocket. Chung et al. teaches a TCR fusion protein comprising V- 
a-peptide linker-V p-C p linked to GPI anchor and expression of such a fusion 
protein in a transfected eukaryotic cell (see results section). Chung et al. disclose 
that the soluble form of TCR protein could be readily obtained by enzymatic 
cleavage with phosphatidylinostol-specific phospholipase C (PI-PLC) (see page 
12656). Chung et al. teaches expression of said TCR fusion protein in a bacterial 
cell system in which the N terminus of the Cp region is linked to a histidine 
protein tag. Chung et al. also disclose a scTCR in which comprises V-a-peptide 
linker-V p-C p GPI in which the Cp component consists of the p chain sequence 
ending right before the last cysteine (the sixth cysteine) (see page 12655). Chung 
et al. further teach that TCR fusion proteins which do not contain the CB do not 
fold into the native conformation. The scTCR disclosed by Chung et al. meet the 
length limitations of the Va. and VP region recited in claims 69 and 71 . Chung et 
al. teach a soluble fusion protein comprising a Va-peptide linker-Vp-Cp 
fragment-protein tag (eg. GPI). Chung et al. does not teach a TCR fusion 
protein further comprising bacteriophage VIII coat protein. 
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However, Barbas discloses a soluble fusion protein comprising a 
bacteriophage coat protein fragment covalently linked to a single-chain 
heterodimeric receptor (see abstract and column 15, lines 27-28, in particular). 
Barbas also discloses that the fusion protein may comprise domains of 
heterodimeric proteins derived from several ligand binding proteins, including 
immunoglobulins and T cell receptors (see column 17, lines 62-66 and column 19, 
lines, 9-28. Barbas discloses that T cell receptor comprises alpha and beta chains 
each having a variable(V) and constant(C) region and T cell receptor has 
similarities in genetic organization and function to immunoglobulins (see column 
19, lines 19-22, in particular). Barbas also teaches that bacteriophage coat protein 
may be derived from cpIII or cpVIII (see column 31, lines 10-28, in particular). 
Barbas discloses that expression vectors expressing soluble fusion proteins in 
which the ligand binding region is fused to bacteria coat protein allows the 
expression of the multiple fusion proteins on the surface of phage particles IE 
approximately 2700 cpVIII heterodimer receptor molecules per phage particle 
(see column 39 line 64 through column 40, line 7, in particular). Barbas further 
discloses that a short length of amino acid sequence at the amino end of a protein 
(IE a protein tag) directs the protein to periplasmic space (see column 8, lines 49- 
55, in particular. One embodiment of the invention is disclosed to be a fusion 
protein comprising in sequence a leader sequence-peptide linker-V region amino 
acid residue-peptide linker-phage coat protein and that in one embodiment, the 
second linker can define a proteolytic cleavage site which allows the 
heterodimeric receptor to be cleaved from the bacteriophage coat protein to which 
it is attached (see column 14, lines 60-65). Thins Barbas discloses but does not 
exemplify a soluble ffunsiom proteinn comprising a bacteriophage coat proteim 
covalemtly linnked to T cell receptor domains. 

Onda et al. disclose a soluble fusion protein comprising a bacteriophage 
coat protein covalently linked to a single-chain T cell receptor by a peptide linker 
sequence wherein the single TCR chain is the alpha chain and the bacteriophage 
coat protein is cpVIII (see abstract and Figure 1, in particular). Onda et al. also 
teach that TCR-bacteriophage coat protein fusion protein can be used to study 
specific binding interactions of the TCR chain to antigenic ligands (see paragraph 
bridging pages 1394-1395, in particular). 

Huse et al. teach that fusion proteins comprising a single chain fusion 
protein comprising Fab fragment of immunoglobulin (which comprises the 
antigen binding pocket of the immunoglobulin molecule) and bacteriophage VIII 
coat protein can be produced and display the fusion protein when expressed in a 
Ml 3 derived vector. Huse et al. further teach that bacteriophage VIII coat protein 



Examiner Schwadron withdrew this statement in the next Office Action dated June 17, 2002 (see below). The 
statement was originally made by a prior Examiner (Lubet) in a related §103 rejection that has been withdrawn. In 
that earlier rejection, Lubet argued that Barbas and Onda do not teach a soluble fusion protein in which a single- 
chain TCR linked to a bacteriophage coat protein. See the Office Action dated June 23, 1998 at pg. 8, part B. In the 
§103 rejection on appeal, Examiner Schwadron argued that Barbas and Onda teach use of TCR -bacteriophage VIII 
fusion protein. That molecule, a heterodimeric T cell receptor fusion protein, is not the claimed invention. 
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fusion protein can recovered from culture medium or from the periplasmic space 
(see abstract). 

Therefore it would have been prima facie obvious to one with ordinary 
skill in the art at the time the invention was made to make a soluble TCR fusion 
protein comprising the Vol. -peptide linker-Vp -Cb fragment-protein taught by 
Chung et al. linked to a bacteriophage VIII coat protein because Barbas et al. 
and Onda et al. teach TCR-bacteriophage VIII coat fusion proteins can be 
wised to study antigen binding properties of such a fusion protein and Huse et 
al. teach that fusion proteins comprising bacteriophage VIII coat protein can be 
produced in bacteria and recovered in relatively large quantities. 

One with skill in the art would be motivated to make such a fusion protein 
to study the antigen binding region of the TCR component or to use the protein to 
elicit anti-idiotypic antibodies. One with skill in the art would be motivated to 
make such a fusion protein in which the Va and Vp region was derived from 
human TCR in order to study human TCR properties or to elicit anti-idiotypic 
antibodies to the TCR component of the protein. 

The preceding grounds of rejection have been maintained since the August 25, 2000 
Office Action despite Appellants' rebuttal argument and claim amendments discussed below. 

C. On October 25, 2000, Appellants' representative met with Examiner Schwadron at 
the USPTO and discussed the art cited. No agreement was reached. 

D. Appellants submitted a response to the rejection set forth in paragraph B, above, on 
February 22, 2001. In that response, Appellants rebutted the prima facie obviousness argument 
by pointing out: 1) that the cited references did not teach or suggest that the membrane "anchor" 
of Chung's single-chain TCR (GPI: a membrane protein) could be substituted with the 
bacteriophage coat protein of Barbas' TCR; 2) that there was no reasonable expectation that the 
substitution (switching Chung's anchor for Barbas' phage coat protein) could be achieved in view 
of substantial differences between scTCRs and TCRs; and 3) that the Examiner's citation of 
Onda was not correct ie., it does not disclose TCR-bacteriophage coat protein fusions, but 
instead, dwelt on smaller constructs having only a V-a chain (but no V-p chain). Onda 
characterized his constructs as having "unusual" binding properties that were not characteristic of 
TCRs. A subset of such constructs were reported by Onda not to work at all. 
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Appellants also discussed the Holler reference: a peer-reviewed scientific article from 
the U.S. Academy of Sciences (PNAS (USA) (2000) 97: 5387 at 5389). Holler provided 
independent and objective evidence of the long-felt need and failure of others in the field to 
make and use the claimed fusion molecules. Specifically, Holler stated that phage display had 
not yet proven successful in the engineering of scTCRs. 



E. In response to the Appellants' arguments in paragraph D above, the Examiner 
issued a Final Office Action dated June 17, 2002. 3 The Examiner maintained the prima facie 
rejection and stated: 

Regarding appellants comments, while heterodiinaeric Molecmiles are a 
preferred embodimemt disclosed in Barbas et al., Barbas et al. disclose: 
"In another embodiment, the present invention contemplates a polypeptide 
comprising an insert domain flanked by an amini-terminal secretion signal 
domain and a carboxy-terminal filamentous phage coat protein membrane anchor 
domain." (column 14, first complete paragraph). 

Barbas et al. further disclose than said construct could include a "receptor 
proteimi" (column 14, second paragraph), indicating that the disclosed method 
could be used for receptors per se (eg. single chain or heterodimeric or single 
chain heteromers). Single chain T cell receptors were known in the art (see Chung 
et al.). 

Regarding appellants comments about the single chain TCR taught by 
Chung et al., 

Chung et al. teach that the GPI anchor is cleaved and the soluble TCR still 
has all the antigen binding properties of the TCR (see pages 12656-12658). Thus, 
the GPI anchor is not required for the soluble TCR to function, it is just used in 
one particular method of making the soluble TCR. Regarding motivation to create 
the claimed invention Cheung et al. discloses that it woinkt be desirable to 
produce their TCR in a phage display system (see page 12658, first column). 
In addition, Barbas et al. teach the advantages of their system for the production 



By "phage display" is meant the process of making a recombinant bacteriophage expressing the scTCR as part of 
the phage protein coat. After infecting bacteria with a recombinant phage engineered to produce the scTCR, the 
protein would be "displayed" on the bacterial cell surface as bacteriophage. The scTCR "displayed" in this manner 
would be amendable to engineering. 

3 The Examiner essentially repeated his rejection of the claims as set forth in the August 25, 2000 rejection. But see 
footnote 1 . With respect to Examiner Schwadron's discussion about Holler, Appellants submitted Weidanz et al. as 
part of a Rule 132 Declaration to address an obviousness rejection that has since been withdrawn. The reference 
provided evidence that a particular scTCR-bacteriophage coat protein vector (pKC44) was capable of forming an 
antigen binding site when expressed. The reference is not prior art to the present application. 
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of peptides. Regarding reasonable expectation of success, both Barbas et al. and 
Chung et al. disclose use of phage display systems to produce single chain 
antibodies (see column 2, third paragraph from bottom and page 12658, first 
column). In addition, the soluble single chain TCR molecules functions with or 
without the GPI linker indicating that the construct itself is functional. 

Regarding appellants comments about Holler et al., said publication was 
published in May 2000. In the amendment filed 6/3/2000, applicant submitted a 
publication by Weidanz et al. (J. Imm. Methods 1998) which discloses the 
claimed invention. Thus, it appears that Holler et ai simply are not familiar 
with the prior art. Thus, the comments of Holler et al. carry no weight 
because two years prior to the Holler et al. publication, Weidanz et al. had 
already published data regarding the production of single chain TCR using 
bacteriophage. Furthermore, Holler et al. discloses a yeast system for producing 
a single chain TCR and it appears that the main focus of Holler et al. is to promote 
their system. 

Regarding appellants comments about Onda et al., the instant rejection 
indicates that "Onda et al. disclose a soluble fusion protein comprising a 
bacteriophage coat protein covalently linked to a single-chain T cell receptor 
by a peptide linker sequence wherein the single TCR chain is the alpha chain and 
the bacteriophage coat protein is cpVIII (see abstract and Figure 1, in particular)". 
The art recognizes that the alpha and beta chains of the TCR generally both are 
involved in antigen binding. The art also recognizes that soluble TCR which bind 
antigen would have a variety of uses. 

F. Appellants filed a Notice of Appeal on October 17, 2002. 

ARGUMENTS 

As an initial mater, Appellants wish to emphasize the substantial differences between 
TCR heterodimers ("TCRs") and single-claim T cell receptors ("sc-TCRs"). 

The TCR is a heterodimer with one a chain and one p chain. 4 Each of these chains 
passes from the exterior of the T cell, through the cell membrane, and into the cell interior 
(cytosol). The a and p chains each include a variable (V-a, V-p) region that cooperate to form an 
antigen binding pocket. The regions are "variable" because its chemical structure can be 



A textbook in the field describes the TCR as "a heterodimer composed of an a and a p polypeptide chain, both of 
which are glycosylated." See Alberts, B et al. (1989) in Molecular Biology of the Cell, 2 nd Ed. Garland Publishing, 
Inc. New York at pg. 1037. By convention, a "heterodimer" such as the TCR properly has two chains (dimer) both 
of which are different (hetero) from the other. 
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changed to make a pocket that fits another antigen. Each of the V-a and V-P regions are 
associated with a constant (C) region. 

TCR heterodimers have been extremely difficult to isolate from T cells. This problem 
has hindered study of the receptor. One approach to address the problem has been to make 
single-chain T cell receptors ("scTCRs"). These synthetic receptors include, on one chain 
instead of two, a fused V-a and V-P region. It has been customary to space the V-a and V-p 
regions from each other with a flexible linker to allow the regions to make an antigen binding 
pocket. Unfortunately, many scTCRs have still proven to be difficult to make and use. 

Appellants point out that "TCR" is understood in the field to mean a hetterodameric T 
cell receptor. The TCR is a membrane bound (insoluble) receptor in which the a and p chains 
cooperate to bind antigen. Reference to a "scTCR" is understood to mean a synthetic simigle- 
chaim molecule that includes the V-a and V-p regions bound together usually through a flexible 
linker. Unlike the TCR, the scTCR binds antigen with only one chain. The TCR and scTCR are 
structurally distinct proteins that are different molecules that bind antigen in different ways. 

I. Summary of the Cited Art 

A. Chung reports functional three-domain single-chain T cell receptors consisting of a 
human Va and Vp region that recognizes a particular antigen (HLA-DR2b/myelin basic protein). 
Chung determined that it was important to fuse a Cp region to the Vp region. Such a three 
domain construct, when linked to a synthetic cell membrane anchor (glycosyl phosphatidyl- 
inositol (GPI) or CD3£ fragment), was found to be expressed and functional. Chung disclosed 
that the cell membrane anchor could be cleaved from the single-chain receptors to obtain soluble 
protein. See the Abstract. 

Chung opined that his single-chain design "may allow" construction of TCR phage 
libraries and that such libraries "may be" tools for studying TCRs. See pg. 12658. However, 
there is no specific disclosure in Chung about how such libraries could be made or, if made, 
whether his single-chain TCRs could tolerate fusion of the bacteriophage coat protein. Chung 
does not report or suggest that the GPI or CD3£ membrane anchor could be substituted with a 
bacteriophage coat protein. Even if there was such a teaching, there is no disclosure in the 
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reference about whether a recombinant bacteriophage could tolerate Chung's scTCR as part of 
the phage coat. 

B. Barbas discloses heterodimeric receptor libraries that use phagmids. In the Abstract, 
phage are taught to encapsulate a genome encoding first and second polypeptides of a receptor 
such as an antibody; in which the first and second polypeptides are integrated into the coat 
matrix of the phage. Barbas generally discloses that such phage may include a polypeptide with 
an "insert domain 11 that has a receptor domain flanked by a secretion signal domain and a phage 
coat protein membrane anchor domain. Col. 14, lines 10-14. Heterodimeric receptors are 
preferred. See Col 3, lines 1-41; Col. 14, lines 15-29; and Col. 15, lines 28-32. According to 
Barbas, there was some uncertainty in the field about which portions of bacteriophage coat 
proteins were needed for phage assembly. Col. 2, lines 19-46. 

Barbas does not teach how to make or use a scTCR with or without a fused bacteriophage 
coat protein. 

C. Onda reports use of a phage display system to explore binding interactions between 
the V-a region and antigen. Onda did not disclose use of the system to study TCR or single- 
chain TCR interactions. In the Abstract, Onda provides at pg. 1387: 

We utilized an Ml 3 phage display system, designed for multivalent 
receptor display, to explore specific binding interactions between various TCR a 
chains and specific antigen in the absence of MHC. 

That is, Onda fused only the V-a region to bacteriophage coat protein. The constructs do not 
include a V-p region and are mot scTCR fusion proteins. Onda's fusions are much smaller TCR "half- 
molecules" lacking the V-p region and antigen binding pocket of Appellants' scTCR. 

Onda at pg. 1395, col. 1, cautioned that his constructs were Mnnsmal and not typical of 

standard TCR interactions: 

Our results extend these findings by demonstrating that the dominant interactions 
of certain TCRa chains for peptide antigens may be sufficiently high that they can be 
analysed independently. However, these interactions are quite nnnnsnal in that they do 
not require the expression of the second TCR subunit or normal MHC and coreceptor 
interactions. These results may raise concern that this model does not reflect typical 
TCR-ligand interactions. 
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Significantly, only some of Onda's Va chain fusion proteins were reported to bind 

antigen when fused to bacteriophage coat protein. At pg. 1395, col. 2 he states that: 

...only a subset of TCR Va have capacity for direct interactions with antigen 
strong enough to be detectable in this system. 

Onda does not teach or suggest fusing a scTCR (V-a and V-p) to a bacteriophage fusion 

protein. 

D. Huse described a phage vector system for screening and producing antibody F(ab) 

fragments. 5 Huse's system was reportedly used to produce free F(ab) and F(ab) displayed on the 

surface of bacteriophage. According to Huse however, not all attempts to produce F(ab) were 

successful. In more than a few instances, the recombinant bacteriophage made to produce the 

fusions apparently would not tolerate certain amounts of antibody protein. In describing 

attempts to display certain antibody H and L chains with his phage vector system, Huse stated on 

pg. 3919, col. 2 that: 

Phage titers of [phage vector] infected cultures were found to decrease 
relative to the level of F(ab)-pVIII fusion protein incorporation (cite omitted). 
Taken together, these results suggest that a functionally viable phage particle- 
may be able to tolerate a limited minimber of incorporated F(ab)-pVIII fusion 
products and that the amomint of F(ab) incorporated into the phage coat may 
adversely affect phage titers and overall F(ab) yield. 

Huse does not disclose fusing a bacteriophage coat protein to a scTCR or TCR. 

II. Summary of the Examiner's Argument 

Grounds for the present rejection under 35 U.S.C §103 were formulated in the Office 
Action dated August 25, 2000. See paragraph B, above. The basis for the rejection has not 
changed substantially in the face of Appellants' arguments and claim amendments. 

According to Appellants' understanding of the Examiner's alleged prima facie case, the 
primary references, Chung and Barbas, are alleged to teach a scTCR linked to a bacteriophage 
fusion protein. Onda and Barbas are relied on to teach that TCR-bacteriophage fusion proteins 
can be used to study antigen binding. Huse is used to teach that fusion proteins with the coat 



F(ab) is an abbreviation for an antigen binding fragment of an antibody ( fragment antigen binding). F(ab) is a 
heterodimer consisting of two different chains ie., the antibody light and heavy chain. F(ab) is readily made by 
cleaving whole antibodies with specific proteolytic enzymes. 
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protein can be made in bacteria. The foundation of the Examiner's position is that because 
Barbas, Onda, and Huse teach some bacteriophage coat protein fusions, then it would be obvious 
to make fusions with Chung's scTCRs. Although facially somewhat plausible, the rejection is 
flawed on both scientific and legal principles as discussed herein. 

III. The Examiner Erred in Rejecting Claims 1. 2, 4. 7, 8, 14, 67, 69, 71 and 72 as Being 
Obvious 

A. Reqimireinnieinits of the prima facie case annd its iraaieteini amice. 

The Examiner erred in maintaining the obviousness rejection in the face of claim 

amendments and the state of the art as submitted made in this case and its parent. The Federal 

Circuit has reiterated that an Examiner's prima facie case is but a procedural tool of patent 

examination, with the express purpose of allocating the burdens of going forward as between the 

Examiner and Applicant. See In re Deckler 977 F.2d at 1449, citations omitted): 

Specifically, when obviousness is at issue, the examiner has the burden of persuasion 
and therefore the initial burden of production. Satisfying the burden of persuasion, 
constitutes a so-called prima facie showing. Once that burden is met, the applicant has 
the burden of production to demonstrate that the examiner's preliminary determination is 
not correct. The examiner, and if later involved, the Board, retain the ultimate burden of 
persuasion on this issue. 

Clearly, as demonstrated herein, adequate evidence of the unobviousness of the claimed 
invention was provided by Appellants to shift the burden of persuasion to the Examiner. 

In view thereof, it is requested that the Board review the obviousness question based on 
the invention as claimed, and the cited references, including all relevant parts thereof. 

B. Standard For Reviewing Am Qbviouismiess Rejection under 35 USC §103. 

The Federal Circuit has reiterated the manner in which obviousness rejections are to be 
reviewed. Where claimed subject matter has been rejected as obvious in view of a combination 
of prior art references, "a proper analysis under section 103 requires, inter alia, consideration of 
two factors: (1) whether the prior art would have suggested to those of ordinary skill in the art 
that they should make the claimed composition or device, or carry out the claimed process; and 
(2) whether the prior art would also have revealed that in so making or carrying out, those of 

11 



ordinary skill would have a reasonable expectation of success." In re Vaeck , 947 F.2d 488, 493, 
20 U.S.P.Q.2d 1438, 1442 (Fed. Cir. 1991), cited In re Dow Chemical Co ., 837 F.2d 469, 473, 5 
U.S.P.Q.2d 1529, 1531 (Fed. Cir. 1988). As the Federal Circuit emphasized by succinctly 
summarizing: "Both the suggestion and the reasonable expectation of success must be founded in 
the prior art, not the Applicants 1 disclosure." Id. See also In re Merck & Co., Inc., 800 F.2d 1091, 
231 USPQ 375 (Fed. Cir. 1986). 

More recently, the Federal Circuit has reviewed the case law regarding 35 U.S.C. §103. 
See InreSang-Su Lee 277 F.3d 1388, 61 U.S.P.Q.2d 1430 (Fed. Cir. 2002). 

Should the Board adopt the Examiner's prima facie case, Appellants submit that the 
claimed invention would not have been obvious in view of the legal standard summarized above. 

C. No Pirimnia Facie Case of Obviousness 

As noted above, the foundation of the Examiner 1 argument rests on the belief that it 
would be obvious to fuse Chung's scTCR to the bacteriophage coat protein of Barbas because, 
allegedly, Barbas and Onda teach TCR-bacteriophage coat fusion proteins and Huse discloses 
that fusion protein with such a coat can be made in bacteria. 

For the Examiner's prima facie case to stand, it is imperative that he establish that: 1) 
The cited references disclose or suggest fusing a bacteriophage coat protein to Chung's scTCR; 
2) there is a settled role for the bacteriophage coat protein in making fusion proteins; and that 3) 
one could fuse Chung's scTCR to Barbas' coat protein with a reasonable expectation of success. 

The Examiner's position is not supported by any of these points. Barbas, as relied on, 
does not teach or suggest a scTCR or even fusion of a scTCR to a bacteriophage coat protein. 
Onda and Huse, when read in their entirety, exemplify uncertainty in the field about using the 
bacteriophage coat protein to make certain fusion proteins. Even Barbas admitted that there was 
some doubt about how much one could change certain bacteriophage coat proteins without 
hindering phage assembly. Moreover, some of Onda's and Huse' molecules did not work well. 
Others did not work at all. On top of that uncertainty is heaped additional doubt about whether 
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Chung's "anchor" fragments could be substituted with the bacteriophage coat protein of Barbas. 
There was also doubt about whether the bacteriophage would tolerate fusion of the scTCR to its 
coat. 

i) Barbas and Onda does not teach or suggest a single-chain TCR (scTCR) 
In the Office Action dated August 25, 2000, Examiner Schwadron took the position that 
Barbas discloses: 

soluble fusion protein comprising a bacteriophage coat protein fragment covalently 

linked to a single-chain heterodimeric receptor (see the abstract and column 15, lines 

27-28, in particular). Barbas also discloses that the ffnsion protein may comprise 

domains of heterodimeric proteins derived from several ligaind bimdiinig proteins, 

including immunoglobulins and T cell receptors (see column 17, lines 62-66 and column 

19, lines, 9-28. Barbas discloses that T cell receptor comprises alpha and beta chains each 

having a variable(V) and constant(C) region and T cell receptor has similarities in genetic 

organization and function to immunoglobulins (see column 19, lines 19-22, in particular). 
* * * * 

Thus Barbas discloses but does mot exemplify a soluble fusion protein comprising a 
bacteriophage coat protein covalently linked to T cell receptor domains 

The heterodimeric receptor proteins pointed out by the Examiner are not scTCRs. 
Heterodimeric proteins, and particularly the TCR of Barbas, are understood in the field to consist 
of two different a and p chains. Brief at pg. 1. Unlike the TCR, the scTCR of Appellants' 
claimed invention is a single-chain molecule with a V-a chain fused to a V-p chain. The position 
that Barbas discloses a "single-chain heterodimeric" receptor simply makes no sense. How can a 
single-chain molecule be a "heterodimer" when that requires two (dimer) different (hetero) 
chains? Brief at pg. 1 and footnote 1. Barbas could not have had the single-chain constructs of 
Onda and Chung in mind. Those references were published well after the priority date of the 
Barbas patent. Thus, nowhere in the reference is there any disclosure about how to make or use 
a scTCR. 

Faced with this rebuttal, the Examiner took the position in the Final Office that Barbas 1 
disclosure of "polypeptides comprising an insert domain" and "receptor proteins" should be read 
to include Chung's scTCRs. Also included in that sweeping reading of Barbas are "single chain 
or heterodimeric or single chain heteromers". That position is without merit. Too much is read 
from Barbas. It does not provide for any scTCR molecules. If the Examiner's overly-broad view 
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of the patent is allowed to stand and sweep in scTCRs, even though Chung's were published well 
after Barbas' priority date, it will preempt any attempt to obtain patent protection for scTCR- 
bacteriophage coat fusion proteins. A principle focus of Barbas was to provide heterodimeric 
receptors linked to a phage coat protein. See the Title of the patent, the Abstract and col. 3, lines 
1-41, for instance. Such receptors are not the fusion proteins Appellants claim and there is no 
suggestion in Barbas to make or use them. 

Even assuming, arguendo, that the Examiner is correct and that Barbas taught or 
suggested a scTCR (years before Chung or Onda were published), one reading Barbas in that 
way would be confused in light of the accepted understanding in the field that a heterodimer such 
as the TCR is a complex of two different polypeptide chains. Brief at pp. 1 - 2 and footnote 1. 

As captioned above, Onda does not disclose a TCR or scTCR fusion to bacteriophage 
coat protein as alleged by the Examiner in the August 25, 2000 and June 17, 2002 Office 
Actions. Instead, Onda reports fusion of TCR a chains to bacteriophage coat protein. The TCR 
a chain is merely a part of the larger scTCR Appellants worked with. That is, the prior 
constructs are significantly smaller (and less likely to cause solubility problems when fused to 
coat proteins) than the scTCR fusions Appellants successfully made. 

Moreover, the Examiner ignored Onda's clear hesitation about reading too much from 
TCR a chain constructs that include a fused bacteriophage coat protein. According to Onda, the 
interactions of the constructs were unusual and not typical of TCR-ligand Interactions. See 
above and Onda at pg. 1395, col. 1. 

Importantly, only some of Onda ! s TCR a chain constructs even worked to bind antigen. 
See above and pg. 1395 of Onda at col. 2, second full paragraph. 

According to Onda then, some TCR a chain-bacteriophage coat protein fusions work and 
some do not. Those that do work were viewed as "unusual" and "not typical". In view of this 
caution, one working in this field would not be encouraged to fixse a bacteriophage coat protein 
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to a scTCR. None of the other cited references shed any light on Onda's clear hesitation to 
extend there findings to other TCR molecules. 

The Examiner thus erred in trying to formulate a prima facie case by not giving due 

weight to all relevant portions of Onda. Contrary to this practice, it is well established that the 

Examiner must consider all relevant portions of cited references, including those portions which 

substantially weaken her position. In particular, the former CCPA stated in In re Mercier 5 1 5 

F.2dll61, 185 USPQ at 778: 

The relevant portions of a reference include not only those teachings which would 
suggest particular aspects of an invention to one having ordinary skill in the art, 
but also those teachings which would lead such a person away from the claimed 
invention. 

See also Phillips Petroleum Co. v. U.S. Steel Corp ., 673 F.Supp. 1278, 1315, 6 USPQ2d 
1065, 1093 (D.Del. 1987), affd, 865 F.2d 1247, 9 USPQ2d 1461 (Fed. Cir. 1989). 

The Board is thus urged to take Onda in its entirety and to consider all relevant portions 
of it including the passages quoted above. Read in this way, as it should, the reference would 
lead one in this field to doubt whether it would be feasible to fuse a bacteriophage coat protein to 
a scTCR to produce a soluble and functional fusion protein. 

ii) Huse reported difficulties producing some bacteriophage coat protein fusions 

The Huse reference, as quoted above, reported that not all F(ab)-p VIII (bacteriophage) 
coat proteins could be made at high titre. That is, Huse stated that the bacteriophage may not 
tolerate some amounts of F(ab) constructs, thereby decreasing phage titres and overall F(ab) 
yield. See above and Huse at pg. 3919, col. 2. When Huse is read in its entirety, as it should, 
the Examiner's statement that "Huse et al. teach that fusion proteins comprising bacteriophage 
VIII coat protein can be produced in bacteria" is an unsupported generalization. Huse clearly 
found that some amounts of heterodimeric F(ab) constructs harmed the bacteriophage that 
carried them. In view of this warning, a worker in the field would have good reason to doubt 
whether a bacteriophage could be fused to a scTCR or even a heterodimer such as a TCR without 
considerable experimentation. 
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The Examiner took the position in the Final Office Action that Barbas and Chung provide 

a reasonable expectation that one could make the claimed fusion proteins: 

Regarding reasonable expectation of success, both Barbas et al. and Chung et 
al. disclose use of phage display systems to produce single chain antibodies (see 
column 2, third paragraph from bottom and page 12658, first column). 

However as clearly illustrated by Huse, not all phage display systems using 
antibodies work as expected. Some amounts of heterodimeric F(ab) antibodies cause 
problems. Thus the Examiner's position is not supported by the art of record in this case. 

The Board is thus requested to take Huse in its entirety and to consider all relevant 
portions of it including the passage quoted above. In re Merrier 515 F.2d 1 161, 185 USPQ at 
778; and Phillips Petroleum Co. v. U.S. Steel Corp ., 673 F.Supp. 1278, 1315, 6 USPQ2d 1065, 
1093 (D.Del. 1987), affd, 865 F.2d 1247, 9 USPQ2d 1461 (Fed. Cir. 1989). 

The substantial uncertainties raised by Onda and Huse have not been addressed by the 
Examiner. No objective scientific work has been made of record to resolve or explain them. 
Read in their entirety, as they should, Onda and Huse point out problems about making and using 
some bacteriophage coat protein fusions. Even if one skilled in this field were to read Onda and 
Huse selectively and disregard their warnings, there is still nothing in the art relied on to suggest 
that one could make or use a scTCR fusion to a bacteriophage coat protein. Even Barbas 
admitted that there was uncertainty about what coat protein parts could be manipulated for phage 
assembly. 

In marked contrast, Appellants have demonstrated that it is possible to fuse a 
bacteriophage coat protein to the scTCR and obtain fully soluble and functional fusion protein. 
See Appellants' patent specification at Example 1 (showing construction of soluble scTCR fusion 
proteins); Examples 2-3 (production of special vectors to make the scTCR fusion proteins); 
Example s 4-5 (expression of Appellants' soluble scTCR fusions); Example 6 (purification of 
the soluble scTCR fusion proteins); Examples 7-1 1 and 16 (characterization of particular scTCR 
fusion proteins); and Example 15 (analysis of a bacteriophage library expressing Appellants 1 
scTCR). 
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Accordingly, the §103 rejection fails both prongs of the Federal Circuit test for 
determining obviousness. See In re Vaeck , supra; and In re Dow Chemical Co ., supra. 
It is submitted that the Board reverse the obviousness rejection in light of this test. 

Whether or not the Examiner is taking the position that it would be obvious to try 
to make the claimed scTCR fusion proteins, both the Board and Federal Circuit have 
made it quite clear that this is not a burden that Appellants must bear. In particular, the 
Court in In re OTarrell 7 USPQ 2d 1673 (1988) held at page 1681: 

The admonition that "obvious to try" is not the standard under §103 has been 
directed mainly at two kinds of error, hi some cases, what would have been 
"obvious to try" would have been to vary all parameters or try each of numerous 
possible choices until one possibly arrived at a successful result, where the prior 
art gave either no indication of which parameters were critical or no direction as 
to which of many possible choices is likely to be successful. 

See also Ex parte Old , 229 USPQ 196, 200 (1985). 

While the Court in In re OTarrell went on to state that while obviousness does not 
require absolute predictability of success, what is required under §103 is a reasonable 
expectation of success. 

Thus whether those in the field may have been tempted to fuse a bacteriophage 
coat protein to a scTCR, the field would have been cautioned from doing so in view of 
the warnings of Onda, Huse and to some extent even Barbas. The Barbas patent does not 
disclose or suggest any scTCR fusion to the coat protein. Chung does nothing to remedy 
these defects. In short, the field's unsuccessful experience with some amounts of 
antibody heterodimers (Huse) and some single-chain constructs (Onda), provides at worst 
no basis for believing that fusion of a bacteriophage coat protein to an scTCR will work 
and at best, a reason to doubt that such a fusion will result in a fully soluble and 
functional protein. 
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iii) No teaching or suggestion that Chung's "anchor" fragment could be 
substituted with Barbas* bacteriophage coat protein 

Chung reported TCRs linked to a cell membrane anchor (glycosysl phosphatidylinositol 
(GPI) or murine CD3 £ chain). The anchor apparently helps to express the single-chain TCRs. 
The anchor molecules are entirely different from the coat proteins of Barbas both in terms of 
chemical structure and function. For example, Chung's anchors are hydrophobic cell membrane 
proteins while those of Barbas are relatively more hydrophilic bacteriophage coat components. 
Chung's anchor apparently plants the scTCR in the membrane while the coat envelops the phage. 

The Examiner has pointed to no teaching or suggestion in the cited art that Chung's 
anchor molecules could be substituted with Barbas' bacteriophage coat proteins. The 
obviousness rejection falls far short of establishing any nexus between Chung's anchors, which 
are attached to his scTCRs, and the coat proteins reported by Barbas. 

iv) Objective Evidence of Non-obviousness 

In addition to the lack of a prima facie case of obviousness, the strong objective 
evidence of non-obviousness presented during prosecution of this case further compels 
allowance of the claims. 

Evidence of such objective indicia of non-obviousness, the so-called "secondary 
considerations" must be considered in all obviousness determinations. Stratoflex, Inc. v. 
Aeroquip Corp.. 713 F.2d 1530, 1538-1539 (1983): 

Indeed, evidence of secondary consideration may often be the most probative and 
cogent evidence in the record. It may often establish that an invention appearing 
to have been obvious in light of the prior art was not. It is to be considered as part 
of all the evidence, not just when the decision-maker remains in doubt after 
reviewing the art. 

See also Graham v. John Deere , 383 U.S. 1, 148 USPQ 459 (1966). 
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This standard set forth by the Federal Circuit applies not only during litigation of 
issued patents, but to a determination of patentability during ex parte prosecution as well. 
In re Sernaken 702 F.2d. 989 217 USPQ 1, 7 (Fed. Cir. 1983). However, in the instant 
case, the Examiner is not properly considered evidence of "long felt need and failure of 
others" in maintaining the present § 103 rejection. 

Specifically, Appellants' provided the Holler reference as indicating that the field 
longed to make the claimed fusion proteins but could not. Holler reported that phage 
display had not yet proven successful in making scTCRs despite what he saw as 
extensive structural similarity between antibodies and TCR V regions. Appellants' 
invention addressed this need and succeeded by providing soluble fusion molecules with 
a bacteriophage coat protein linked to the scTCR. The Holler reference is highly 
probative of the difficulties the field had in making these molecules and should be given 
substantial weight by Examiner Schwadron. MPEP 716.01(b). 

The Examiner completely dismissed the Holler reference on grounds that "Holler 
et aL simply not familiar with tine prior art". See Part D, above. That is no basis for 
disregarding the Holler's statement that the field wanted but failed to produce the claimed 
invention. Appellants are under no burden to provide evidence of Holler's knowledge of 
the art in order to have the reference considered as objective indicia of non-obviousness. 
See Stratoflex, Inc. v. Aeroquip Corp. , 713 F.2d 1530, 1538-1539 (1983); In re Sernaker, 
702 F.2d. 989 217 USPQ 1, 7 (Fed. Cir. 1983); and MPEP 716.01(b). 

Moreover, Examiner Schwadron's citation of Weidanz et al. to support his 
disregard of Holler is clearly improper. That reference is not prior art and cannot serve as 
a basis for ignoring Holler or substantiating the obviousness rejection on appeal. 

It is requested that the Board consider Holler as objective evidence that workers in 
field wanted, but could not make, the claimed invention. 

CONCLUSIONS 
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For the Examiner's prima facie case to stand, he has the burden of showing that: 
1) The cited references disclose or suggest fusing a bacteriophage coat protein to a 
scTCR; 2) there is a settled role for the bacteriophage coat protein in making fusion 
proteins; and that 3) one could fuse Chung's scTCR to Barbas* coat protein with a 
reasonable expectation of success. These points have not been made by the Examiner. 
As discussed above, Barbas does not disclose scTCRs. Moreover, there was significant 
uncertainty in the field about whether it was possible to fuse a bacteriophage coat protein 
to a scTCR as exemplified by Huse and Onda. In view of the cited art and in 
consideration of the Examiner's position, it could be argued that one might be motivated 
to test fusing the bacteriophage coat protein to Chung's scTCR in the hope of producing a 
soluble and function protein. But this is not the legal standard required by our case law. 
It is without a doubt not obvious from the art of record to make the claimed invention of a 
scTCR fused to a bacteriophage coat protein. 

Appellants submit that they have overcome the Examiner's obviousness rejection 
in the view of all the facts and argument of record in this case. Simply put, one of skill in 
this area would not be able to predict, with any reasonable expectation of success, how to 
make and use the claimed invention. 

Importantly, Appellants have provided experimental evidence clearly showing that it is 
indeed possible to make and use scTCR-bacteriophage coat protein fusions. See Examples 1-11, 
15 and 16 as discussed above. 

In summary, Appellants submit that the instant invention is both novel and unobvious. 
The arguments set forth above establish that non-obviousness. 
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Although it is not believed that the present submission requires any fee for 
consideration by the Office, the Examiner is authorized to charge such fee to our deposit 
account 04-1105 should such fee be deemed necessary. 



Date: A^T'^ J O^f 



Respectfully submitted, 




Robert L. Buchanan (Reg. No. 40,927) 

EDWARDS & ANGELL, LLP 

P. O. Box 55874 

Boston, MA 02205 

Tel. (61 7) 439-4444 

Fax (617) 439-4170/ 7748 

Customer No.: 21874 
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EXHIBIT A 
Claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 



on appeal 



What is claimed is: 

1 . A soluble fusion protein comprising a bacteriophage coat protein covalently linked to 
a single-chain T cell receptor comprising an antigen binding pocket, wherein the single-chain T 
cell receptor comprises a V-a region covalently linked to a V-p region by a peptide linker 
sequence that effectively positions the V-a region and the V-p region to form the antigen 
binding pocket, the soluble fusion protein further comprising a C-p region fragment. 

2. The soluble fusion protein of claim 1, wherein the C-terminus of the V-a region is 
covalently linked by the peptide linker sequence to the N-terminus of V-P region. 

4. The soluble fusion protein of claim 2 wherein the C-p region fragment is covalently 
linked between the C-terminus of the V-p region and the N-terminus of the bacteriophage coat 
protein. 

7. The soluble fusion protein of claim 2, wherein the peptide linker sequence 
contains from approximately 2 to 20 amino acids. 

8. The soluble fusion protein of claim 1, wherein the bacteriophage coat protein is 
gene III or gene VIII protein. 

14. A soluble fusion protein comprising covalently linked in sequence: 1) a V-a 
region, 2) a peptide linker sequence, 3) a V-p region covalently linked to a C-p region fragment, 
and 4) a bacteriophage gene VIII protein, wherein the peptide linker sequence effectively 
positions the V-a region and the V-p region to form an antigen binding pocket. 

67. The soluble fusion protein of claim 1, wherein the C-terminus of the V-P region 
is covalently linked to the N-terminus of a C-p region fragment. 
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69. The soluble fusion protein of claim 1, wherein the V-cc region and the V-p region 
are about 200 to 400 amino acids in length. 

71. The soluble fusion protein of claim 1, wherein the C*P region fragment is about 
50 to 126 amino acids in length. 

72. The soluble fusion protein of claim 70, wherein the C-p region fragment does not 
include a cysteine residue corresponding to position 127 of a full-length C-p region. 
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APPLICATION OF A FILAMENTOUS PHAGE pVID FUSION PROTEIN SYSTEM 
SUITABLE FOR EFFICIENT PRODUCTION, SCREENING, AND MUTAGENESIS 

OF F(ab) ANTIBODY FRAGMENTS 

WILLIAM D. HUSE, 1 * TIMOTHY J. STINCHCOMBE.* SCOTT M. GLASER.' LISA STARR/ 
MICHAEL MacLEAN/ KARL E. HELLSTR6M/ INGEGERD HELLSTROM/ and DALE E. YELTON* 
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We describe the application of a novel filamentous 
phage vector system suitable for efficient screening 
and production of F(ab) antibody fragments. The 
vector system can concurrently produce free F(ab) 
fragments and F(ab) displayed op the surface of M13 
bacteriophage via a V„C M l-pVIII fusion protein. 
When expressed in a supO (nonsuppressor) strain of 
Escherichia coli free F(ab) can be produced. Anti- 
body F(ab) fragments are secreted into culture me- 
dium at concentrations up to 0.3 mg/liter and con- 
veniently subjected to detailed analysis with little 
ar no purification. Higher concentrations of F(ab) 
[approximately 10 mg/liter) were found to accumu- 
late in the periplasmic space. In this report the 
irector system is shown to produce correctly folded 
auid assembled F(ab) fragments of chimeric L6, a 
mAb against a tumor-associated Ag expressed by 
many human carcinomas. 

Until recently mAb have been primarily produced In 
mammalian cells. The slow growth rates and difficulty 
n genetically manipulating antibody genes expressed In 
nanunalian cells have motivated development of meth- 
xls to express antibody genes In simpler organisms. Mo- 
ecular cloning techniques in bacteria have facilitated the 
production and manipulation of antibody fragments, in- 
creasingly aiding the search for useful antibodies (1.2). 
Methods have been reported for the expression of anti- 
body fragments in Escherichia coll using plasmids (3- 
T), bacteriophage X (8-11). and more recently the flla- 
nentous phage Ml 3 (12-17). Very large combinatorial 
ibraries of 10 5 to 10 s distinct antibody specif lcitiea can 
>e created In microorganisms, far greater than can be 
ichleved with hybridoma cell fusion methods (9. 10). 
iowever, the rapid Identification, isolation, and. If nec- 
essary, modification of antibodies with the goal of lm- 
) roving affinity or redirecting specificity for Ag depends 
leavlly on the availability of powerful screening methods 
>oth in terms of sampling large numbers of antibody 
ragments and evaluating certain aspects of antibody 
>lndlng. Hence, an antibody expression vector that per- 
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mits rapid cloning and mutagenesis of antibody V region 
genes and produces sufficient levels of antibody for bio- 
chemical analysis would be highly desirable. 

Because antibody fragments can be displayed oh the 
surface of filamentous phage, bacteriophage Ml 3 vectors 
are proving particularly valuable in creating and screen- 
ing sequence libraries for antibody fragments of interest. 
Briefly, fusion proteins are created by inserting DNA 
encoding an antibody fragment in front of a phage coat 
protein gene (18, 19). The fusion proteins become an- 
chored in the phage coat via the coat protein and antibody 
sequence Is displayed at the phage surface. Phage-bear- 
ing antibody sequences of Interest can be detected by Ag 
binding and Isolated in Infectious form (13, 14). In addi- 
tion to antibodies the Incorporation of malarial protein 
(20), growth hormone (21), and a hexapeptlde library (22) 
Into the coat proteins of filamentous phage has been 
reported. 

We have developed an M13 filamentous phage vector 
system that can produce and display F(ab) as a fusion 
product to pVm coat protein and can also synthesize free 
F(ab) in quantity. We report here the production of chi- 
meric L6 antibody F(ab) fragments In these M13-derlved 
vectors. L6 is a mAb against a tumor-associated cell 
surface Ag expressed by many human carcinomas (23). 
L6 has been shown capable of lyslng cancer cells In vitro 
(24) and is currently the subject of clinical trials (25). 

MATERIALS AND METHODS 

Construction of bacteriophage M13IXL604 for expression of L6 
Flab). Restriction enzymes, calf Intestine alkaline phosphatase, T4 
polynucleotide kinase. T4 DNA polymerase, and T4 DNA Ugase were 
purchased from Boehrlnger-Mannheim (Indianapolis. IN). Total RNA 
was Isolated from the chimeric L6^ secreting cell line described by 
Fell et al. (26) by the guanidlntum thlocyanate-phcnol/chloroform 
method (27), First strand cDNA was synthesized using ollgo dT and 
BRL Superscript reverse transcriptase (GIBCO BRL. Grand Island. 
NY) and PCR' ampUf IcaUon of L6 H chain (Vh~Cm 1 ) and L chain (V L - 
Cm) sequences was performed by the method of SaikJ et al. (28) as 
modified by Sastry et al. (29). The following primers were used for 
PCR amplification. The restriction endonuclease recognizing the 
boldface and underlined cloning site In the sequences Is Indicated 
within the parentheses: forward V„ primer (Xhol): 5'-CAGTCTGGA- 
OCTGA GCTCGAQA AGCCTGCAGAG-3^: forward V L primer [Ncoi) 

1 Abbreviations used In this paper: PCR. polymerase chain reaction: 
gVin. gene Vffl coding for the major coat protein of M13 phage: pVm. 
major coat protein of Mid phage; C*l H chain C region 1 V u L chain V 
region: IPTG. lsopropyl-^^tlilo^lsctopyranoslde: V M . H chain V region; 
MOPS. 3^N-morpbx>llno)propanc sulfonic add: TES. 30 mM Trts-HCl. 2 
mM EOT A. 2% sucrose (w/v). pH 8.0: supE. strains of E. coli that carry a 
glutamlne Inserting amber (UAG) suppressor tRNA: MES. 20 mM 3-{N- 
morphollno)propane sulfonic add. 2 mM EDTA. 20% sucrose (w/v). pH 
7.5; supO, strains of E. colt that do not carry a suppressor tRNA. 
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5 ' <kxx:aaccac^atoocccaaa 

reverse C«l prtodTSp^VrGTOTG^ 

GGGCTCTGC-3'; reverse Ot primer (Xbal): 5 -COGCTTAACTCT; 
AGACTAACACTXriXXXXrTOTTCAAGCT-3'. The Vh-C„1 and V l -0 
chains were digested and llgated Into prepared Ml 31X31 and 
M13DC12 vectors (30) resulting In M 1 3DC3 1 L6-H and Ml 3DC12L6-L. 
respectively (Fig. 1). Sequence corrections of the cloned L6 VVCnI 
and V L -C* chains were accomplished by slte-<ilrected mutagenesis 
of uracil-substituted ssDNA as described (31. 32). The correcting 
nucleotides are Indicated by the underlined sequences. Five N-ter- 
minal V H amino acids were Inadvertently omitted from the original 
L6 sequence information resulting In an 1 1 -amino acid de letion upon 
cloning. One V H correction primer was 5 ' <rTXnXXAGGCI iCTTC- 
AGCTCAf^TCC AGA GGCTTTTG(C/T)CAC-3' . This primer cor- 
rected the Xhoi cloning site back to the original IG sequence and 
replaced V„ codons 7 to 10. The C/T mixed site introduced an amino 
add change In the leader sequence that was found to Increase 
expression. The following V M primer served to restore the remaining 
six N-terminal amino adds 5 ' <^GCTCAGGTXX^G ACTCCACC AAC- 
TG6ATCTG< kKXATXXXnXK}TTGGGC"3'. The foUowlng V t primer 
served to correct an Incorrect nucleoti de in the reverse C* PCR 
primer: 5 ACTCTCOCXTGTTGAAGCTCTTT^ . The H chain 

encoding M13IX31L6-H and L chain encoding Ml 3IX12L6-L vectors 
were combined by annealing as described (30) to form M131XL604 
(Fig. 1). M131XL605 was derived from M131XL604 by mutation of 
the TAG (stop) codon, located between the H chain encoding se- 
quence and the pseudo wild-type gene VttJ. to GGT (glycine) using 
the oligonucleotide 5 ' -CGCCTTC AGC ACCGG ATCC ACT AGT-3' so 
that continual Vh-C* 1 -p VH! fusion protein would be made. DNA 
sequence analysis of ssDNA prepared from phage Isolates was per- 
formed with Sequenase Version 2 according to the manufacturer 
(United States Biochemical, Cleveland. OH). 

Anttbodies and reagents. The anil-Id mAb to L6 have been 
previously described (33). Antl-ld 1 Is a 7 2b isotype. anil-Id 3 is a 
7 2a Isotype. and both anti-Id 7 and antl-ld 13 are 7 1 laotypes. 
Alkaline phosphataAc -conjugated antibodies were purchased from 
Fisher Biotech (San Francisco. CA). Unconjugated goat anti-human 
e antibody was purchased from Caltag Laboratories {So. San Fran- 
cisco. CA). Rabbit antt-M13 IgG was purified by Sepharose-proteln 
A chromatography and exhaustively absorbed against whole E. colt 
The anli-M13 antibody was subsequently biotlnylated using o-bk>- 
tlnoyl-c-amlnocaprolc acid N-hydroxyaucclnlmldc ester (Boehringcr- 
Mannhelm) using standard chemistries. V ectasia In avidln-horse- 
radish peroxidase complex (Vector Laboratories. Buiilngame. CA) 
and streptavldln-alkallne phosphatase complex (Boehrtnger-Mann- 
hclm) were used for second step reactions. 

Screening by replicate JUter lifts. M 1 3IXL604 phage were plated 
at a low plaque density. F(ab) expression was Induced by overlaying 
the plate with a 0.45-m nitrocellulose filter (Schleicher and Schuell, 
Keene. NH) soaked In 10 mM 1PTG and Incubating at room temper- 
ature from 6 h to overnight. The filter was removed and placed In 
blocking buffer (Bloslte Diagnostics. San Diego. CA) to block nonspe- 
cific binding sites. Phage growth was resumed by incubating the 
plate for an additional 2 h at 37*C and a second filter appUed as 
described above. This procedure was repeated for the last filter and 
all filters were then blocked in blocking buffer. All monoclonal or 
polyclonal antibodies to be used for screening were diluted In block- 
ing buffer. Filters were probed with either alkaline phosphatase- 
conjugated goat antibody to human X -chain, alkaline phosphatase- 
conjugated goat antibody to human .-chain, or antl-ld 3. which binds 
to L6 antibody. In the case of antl-M 3. a secondary alkaline phoa- 
phatasc-conjugated goat antibody to mouse Igr^a was used for de- 
tection. All filters were then washed three times for 10 mln with 25 
mM Trts, 0.137 M NaCl. 5 mM KCI. 0.9 mM CaCl,. 0.5 mM MgCl,. 
and 0.05% Tween 20 (pH 7.4) and developed with alkaline phospha- 
tase substrate reagent (Bio-Rad. Richmond, CA). 

FJab) production and purification. For analytic scale production 
of F(ab) the supE amber suppressor strain XL-1 (Stratagene. San 
Diego. CA) and the supO nonsuppressor strain MK30-3 (Boehringer- 
Mannheim) were each grown In 2X YT medium at 37*C until the 
cultures reached a density of 0.4 to 0.6 at OCW Each strain was 
then diluted 1/10 into three culture tubes containing 3 ml 2X YT 
and infected with 3 >U of high titer (10 11 plaque-forming units/ml) 
phage stock of M13DCL604. M13DCL605. or M13IX31/tube and In- 
cubated with shaking for 3 h at 37*C. Protein synthesis was Induced 
by the addition of D?TG to a final concentration of 1 mM and shaking 
allowed to proceed for 10 to 14 h at ambient temperature. Pcri- 
plasmic fractions were prepared essentially as described by Skerra 
and Pluckthorn (5). The Infected cultures were centrtfuged for 10 



* M 1 31X31 and Ml 31X1 2 vector* are available at no charge from IxsyK 
San Diego, CA. 
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Figure 1. Construction of M 1 31XL604 vector for expression of L6 Fab. 
The M13DCL604 vector product contains one copy of pseudo wild-type 
gVlfl (gVHP~) and one wild-type gvm (gVDJ) both downstream from the H 
chain encoding region, gvm^ has been altered in sequence to reduce 
homologous recombination with gvm contained on the same vector. The 
presence of gVffl reduces the likelihood of selection against certain fusion 
proteins as the result of compromised phage viability (34). 

mln at 4*C and the cleared culture supernatant was reserved. The 
bacterial pellet was resuspended In 60 ml cold TES and digested on 
Ice for 10 mln after adding an equal volume of a cold freshly prepared 
solution containing 2 mg/m! lysozyrne in TES. The periplaamic space 
containing free F[ab) was fractionated by centrif ligation at 9000 rpm 
for 10 mln at 4*C The soluble periplaamic fraction was retained and 
diluted in blocking buffer to adjust the concentration of Ftab) to be 
equivalent to that found in the cleared culture supernatant 

For purification of L6 F(&b) a 1 -liter culture of MK3C-3 was grown 
and Infected with M131XL604 as described above. The cells were 
harvested by centrif ugatlon at 5800 x g for 10 mln at 4*C. The pellet 
was rcsuspended in 20 ml MES at ambient temperature. An equal 
volume of 2 mg/ml rysoryme In MES was added with mild vortcxlng 
and the suspension Incubated at ambient temperature for 10 mln. 
The soluble periplasmlc fraction was Isolated by centrif ugatlon at 
9700 x g for 10 mln at 4*C- The periplasmlc fraction was subjected 
to a second centrif ugatlon at 12.000 x g for 30 mln at 4*C. The 
cleared periplasmlc fraction was loaded onto a macioprep 50 S 
support (Bio-Rad) washed extensively with 20 mM MOPS, pH 7.5 
and ehited with 20 mM MOPS. 120 mM Nad. pH 7.5. The partially 
purified L6 F|ab) was concentrated by centrif ugatlon in a Centrtcon 
30 device (Anucon. Beverly. MA) and size fractionated on a Btogel P- 
60 (Bio-Rad) column equilibrated and ehited with 20 mM MOPS. 1 20 
mM NaCl. pH 7.5. Eluted L8 F\ab) fractions were assayed for binding 
to antl-ld 3 by EUSA. Fractions ehitlng at 8. 9. and 10 ml were then 
pooled and concentrated. Samples from each purification step were 
analyzed by SDS-PAGE on a 10% nonredudng gel followed by stain- 
ing with Coomassle brilliant blue. 
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EUSA characterization of L6 Ffabf produced by M13IXL604. 
U3TXL604 phage stocks were prepared for assay by overnight 
ifectton of XL-1 In LB broth and 10 jg/ml tetracycline to maintain 
xprcsak>n of F'. and 1 mM IPTG. The bacteria were removed by 
entrlf ugatlon (10 min In a mlcrofugej and the phage solution was 
ssayed for binding to L6-speclflc anti-Id 1 . 3. 7. 13. and to an anti- 
-chaln antibody by EUSA. All antibodies were coated onto Immulon 
! mlcrotlter plates (Dynatech Laboratories. Chantllly, VA) at 1 0 *<g/ 
il In 0. 1 M NaHCOa, pH 8.5, overnight at 4*C. The antibody solution 
ras shaken out and the plates were blocked with 300 /J of specimen 
locking buffer (Genetic Systems. Seattle. WA) for I h at room 
rmperature. Plates were washed before use with 0.5% Tween 20 in 
.15 M NaCl and with the same solution between incubations. Phage 
llutlons were dispensed Into mlcrotlter plates In 100 ^1 of blocking 
uffer and incubated overnight. After washing. 100 >J of conjugate 
locking buffer (Genetic Systems) containing 1 jig/ml of biotlnylated 
abb« antibodies to M 1 3 were Incubated 1 h at ambient temperature. 
Tter washing. 100 pi of Vedastaln avidln-HRP complex were In- 
u bated 30 mln at room temperature.' After a final wash step. 100 
I of chromogenlc substrate (3.3\3.5'-tetramethyJbenzidine) in a 
Itrate/phosphate buffer were added. The reaction was stopped with 

00 of 3 N H^S0 4 at various times ranging from 10 min to 2 h to 
chleye an optimum signal to background ratio for each different 
late-coating antibody. Plates were read on a mlcroplate reader 
Blotek. Burlington. VT) In dual channel mode at 450/B3O nm. All 
ssay points were measured In duplicate. Ml 31X31 served as a 
egative control. 

The expression of M13IXL604. M131XL605. and. Ml 31X31 in 
actertal strains MK30-3 and XL-1 was also analyzed by EL1SA. 
ulture supernatant and diluted periplasm Ic f ractlons prepared from 
lfected XL-1 and MK30-3 bacteria described above were serially 
i luted In diluent (Bloslte Diagnostics), added to microliter plates 
oated with anti-Id 3 and Incubated for 2 h at ambient temperature. 
6 F(ab) expressed In the culture medium or in the perlplasmic apace 
'as detected with an alkaline phosphatase-conjugated goat antibody 
> human * -chains. Expression of L6 F(ab) displayed on the phage 
j rf ace was detected by Incubating the captured sample first with 
ie biotlnylated rabbit antibody to M13 and then with streptavldtn- 
tkallne phosphatase complex. The washed plates were developed 
1th 6 mg/ml phenol phthaletn monophosphate In 0. 1 M amlnome- 
lylpropanedlol, 0.5 M Trts. and 0. 1 % NaN,. pH 10.2 (JBL Scientific, 
an Luis Obispo. CA) for 10 to 30 mln. The reaction was stopped by 
ddition of one-third volume of cold 30 mM Tris base. 6 mM EDTA. 
nd the abeorbance at 560 nm measured. 

RESULTS 

Construction of L6 Ffab) expression vector 
tl31XL604. The construction of M13IXL604 is shown 

1 Figure 1. Total RNA was Isolated from the chimeric 
6-sccretlng cell line and cDNA synthesized. After PCR 
mpllflcatlon of the cDNA using sequence- specific 
rimers, the chimeric L6 V L -Gr L chain was digested to 
ompletion with Ncol and Xbal and cloned Into the Ncol/ 
Ixxl site of M131X12 to construct M13IX12L6-L. Slmi- 
trly, the chimeric L6 V m -Ch1 H chain PCR product was 
lgested to completion with Xhol arid Spel and cloned 
lto the Xho\/Spe\ site of M13DC31 to construct 
I13IX31L6-H. Ml 31X31 contains an amber stop codon 
>cated directly 5' of a modified gVlII gene (pseudo-gVUI) 
ncoding mature Ml 3 pVTTi major coat protein. Cloning 
nUbody V H or V„-Cm1 regions into the Xhol/Spel site of 
[131X31 abuts these antibody sequences In frame with 
le amber stop-pseudo gVIH sequence. Thus. 
(131X31L6-H should produce L6 V^ChI-pVuT fusion 
roduct when expressed in an E. colt amber suppressor 
Jain such as XL-1 and produce predominantly free L6 
h-ChI protein when expressed In a nonsuppressor 
:rain such as MK30-3. The H and L chain vectors were 
ien recombined Into the single M131XL604 expression 
hage by annealing Hindlll/T 4 DNA polymerase-digested 
I13IX12L6-L to M/uI/T4 DNA polymerase-digested 
1 1 3EX3 1 L6-H through the homologous regions found be- 
iretn the H tndm and MIuI restriction sites contained In 
3th vectors (30). Correct L6 H and L chain sequences 



were confirmed by DNA sequence analysis. The vector 
Ml 31X1*605, which contains a GGT codon coding for 
glycine in place of the amber stop found in M13IXL604, 
was constructed by site-directed mutagenesis of 
M13IXL604. Thus, M13IXL605 should produce L6 V„- 
Ch 1 -pVTII fusion product irrespective of the bacterial host 
used for expression. 

Functional characterization and purification of 
M13IXL604 expres$ed L6 Ffabh M13DCL604 phage ex- 
pressing L6 Ffab) was Initially characterized by replicate 
filter lift assays (Fig. 2). XL-1 bacteria were Infected at a 
low multiplicity of Infection resulting in low plaque den- 
sity of phage. No positive plaques were detected by filter 
lift assay when probed with alkaline phosphatase-con- 
Jugated goat antibody to human XL chain (Fig. 2A). In 
contrast, numerous, superimposable. uniform positive 
.plaques were detected .in assays of human * -chain (Fig. 
2B] and anti-Id 3, which recognizes assembled L6 H and 
L chain (Fig. 2C). Thus, screening by replicate filter lifts 
allows for detection of functional F(ab) expressed in this 
system. 

To further evaluate the functional integrity of L6 F(ab) 
expressed by M131XL604. phage displaying L6 F(ab) on 
the phage surface were assayed for binding to a panel of 
distinct mouse monoclonal anti-Id raised against the 
mouse mAb L6. EUSA experiments shown in Figure 3 
demonstrated that four anti-Id antibodies specific for L6 




Figure X Screening for L6 f^fab) reactivity by replicate filter lifts. 
M13IXL604 phage wu rcplated at low density for screening. Replicate 
filter lifts were prepared and probed wtth A) alkaline phosphataac-con- 
Jugatcd goat antibody to human X-chain. B) alkaline phosphatase-conju- 
Rated goat antibody to human t-chaln. and C) antl-kJ 3 antibody. The 
filter in C was washed and Incubated with alkaline phoophatasc-conju- 
gated goat antibody to mouse Ig72a. Filters were then washed and devel- 
oped with alkaline phosphatase substrate reagent. 
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are recognized by the fusion protein displayed on the 
M13IXL604 phage. In the ELISA format used, 
M13IXL604 phage were grown In the suppressor bacte- 
rial strain XL-1. and phage displaying incorporated L6 
F(ab)-pVni protein are captured by solid phase anti-Id 
antibodies and detected with anti-M13 antibodies. Thus, 
only phage-bound L6 F{ab) is detected. The four anti-Id 
antibodies exhibit different binding specif icities to L6 
(33). Anti-Id 1 and 7 bind to L6 L chain, 13 to H chain 
and 3 to assembled L and H chains. The binding of 
M13IXL604 phage to an anti-*-chain antibody further 
confirmed the assembly of L6 L chain with L6 H chain- 
pVIIl fusion protein to yield F(ab), because L6 L chains 
are of the * type (Fig. 3). Ml 31X31. which contains no H 
chain-encoding region served as a negative control for 
each anti-Id. A representative curve shows no Ml 31X31 
binding. The ELISA results in Figure 3 thus indicate the 
presence of both L6 L and H chains and confirm that 
proper folding and assembly of L and H chains on the 
phage surface have been achieved. 

Purification of L6 F(ab) fragments produced by 
M131XL604 phage grown in the nonsuppressor bacterial 
strain MK30-3 was accomplished by loading the crude 
periplasmtc fraction prepared from a 1 -liter shake flask 
culture onto a cation exchange resin. The eluted material 
was concentrated and size fractionated by gel exclusion 
chromatography. Figure 4A shows the elution profile 
from the gel exclusion column as assayed by ELISA for 
functional binding to anti-Id 3 using anti-human « to 
detect captured L6 F(ab). The purified F(ab) was shown 
to have a M r of 42.7 kDa by SDS-PAGE. a value consistent 
with the 48-kDa size of known F(ab) fragments (Fig. 4B). 
Based therefore on both functional and physical proper- 
ties, the identity of the purified material is chimeric L6 
F(ab). 

Regulated expression of M13IXL604 production of 
F(ab)and phage displayed Flab). The proper activity of 
M13IXL604-expressed L6 F(ab) having been established, 
the functioning of the expression control system of 
M13IXL604 was tested. This involved comparing levels 
of fusion F(ab) with free F(ab) produced when phage were 
grown in suppressor strains [supE) vs nonsuppressor 




Dilution' 1 

Figure 3. ELISA characterization of L6 F{ab) produced by M 1 3IXL604. 
Various dilutions of phage expressing L6 F(ab) on the phage surface were 
evaluated for binding to anti-Id 1. 3, 7. and 13 and to an anti-human <- 
chain antibody. Bound phage were then detected as described in Mate- 
rials and Methods. Curves represent the average of triplicate assays. 
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Figure 4. Purification of L6 F(ab) expressed by M 1 31XL604- Infected 
MK30-3 bacteria. A) elution profile of L6 F\&b) fractions assayed for 
binding to anti-Id 3 antibody by CUSA. Fractions eluting at 8. 9. and 10 
ml were then pooled and concentrated. B) SOS-PAGE analysis of purified 
Fab. Samples from each purification step were analyzed by SOS-PAGE 
on a 10% nonredudng gel followed by staining with Coomassle brilliant 
blue. The L6 chimeric F(ab) standard was prepared from whole IgG by 
papain digestion followed by protein A chromatography (Pierce. Rockford. 
IL). 

strains [supO) of bacteria, respectively. This comparison 
was made using ELISA assays employing anti-Id 3 to 
detect assembled L6 F(ab). F(ab) displayed on the surface 
of M 1 3 phage was detected using the U 1 3 assay described 
in Figure 2. M13IXL605. a variant construct of 
M131XL604 that has the amber stop codon replaced with 
a glycine encoding codon and therefore always synthe- 
sizes Vh-Ch 1 -pVm protein, served as a control. 

When phage were grown in the supE strain XL- 1 , both 
M13IXL604 and M13IXL605 secreted phage displaying 
F(ab) into the culture medium (Fig. 5 A) and, as expected, 
little or no mature phage displaying F(ab) was detected in 
the pertplasmic space (Fig. SB). In the supO strain 
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?tgurt5. Characterization of the 
3TXL604 expression control system, 
uld cultures of the amber suppres- 
straln XL-1 (supE) and the non- 
presaor strain MK30-3 [supO) were 
h ln/ected with either M131XL604. 
MXL605. or M13DC31 and culture su- 
natant and pcrlplaaxnfc fractions pre- 
ed as described In Materials and 
hods. Culture supernatant and pert- 
tm from Infected cultures were as- 
ed for functional binding to anti-Id 3 
^ab) displayed on phage surface [A to 
ind by total F(»b) (£ to H). 
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30-3. the amber stop codon In M13IXL604 should 
vent fusion protein formation. Figure 5C shows phage 
playing F(ab) to be sharply reduced In M131XL604 vs 
3DCL605. The low signal seen In the M13DCL604 cul- 
e medium Is presumably caused by nonspecific asso- 
tion of L6 F(ab) with phage particles and/or transla- 
lal readth rough. Again no mature phage displaying 
b) was found in the periplasm (Fig. 5D). In examining 
production of F(ab) by the anti-Id 3/«-chaln assay 
h M13IXL604 and M131XL605 secreted antl-*-reac- 
: material into the culture medium when expressed in 
icr XL-1 or MK30-3 (Figs, 5E and 5G). Because the 
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efficiency of amber suppression can be widely variable, 
we expect that M131XL604 expressed in XL-1 should 
secrete free F(ab) In addition to F(ab) displayed on the 
surface of phage. Thus, the signal In Figure SE should 
represent some combination of the two molecules. Poly- 
ethylene glycol precipitation of both M13IXL604 and 
M13IXL605 phage from Induced XL-1 culture supcrna- 
tants showed that approximately 95% of the anti-x-reac- 
tive material remained in the polyethylene glycol-cleared 
supernatant (data not shown) indicating that the predom- 
inant secreted product is free L6 F(ab). Consistent with 
the view that M13IXL604 and M13KL605 secrete free 
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L6 F(ab), abundant amounts of soluble assembled F(ab) 
were detected In the peiiplasmlc space In both strains of 
E. coll (Fig. 5. Fand H). MK30-3 appeared to consistently 
produce higher concentrations of F(ab) than XL-1 . Quan- 
titative ELISA demonstrated that the concentration of 
free F(ab) produced by M13LXL604 grown in the supO 
strain MK30-3 approaches values up to 0.3 mg/llter In 
culture supernatant and concentrations approaching 10 
mg/liter within the periplasmic space (data not shown). 

DISCUSSION 

M 1 31X31 (for H chains) and Ml 31X1 2 (for L chains) are 
the Ml 3 vectors that can receive polyclonal sets of V 
regions for constructing combinatorial libraries or Indi- 
vidual antibody genes for V region production and/or 
mutagenesis (30). The system Is the first to allow differ- 
entially controllable production of free and fusion F(ab). 
Upon recomblnlng the two vectors, all of the control 
elements required for regulated expression of phage dis- 
played F(ab) or free F(ab) are contained in a single expres- 
sion vector. For phage display F(ab) is synthesized as a 
fusion product to pVm coat protein, the major structural 
protein of the filamentous Ml 3 phage particle. In the 
Ml 3 bacteriophage the major coat protein pVTfl is ex- 
pressed in several thousand copies per phage particle. 
Although this level of expression may be useful for pro- 
ducing F(ab) as F(ab)-pVIII fusion proteins, pVIII synthe- 
sized solely as a fusion product Is not likely to form a 
properly assembled coat as a result of steric hindrance 
by the much larger H chain polypeptide. We reasoned 
that If wild-type pVUl. In addition to F(ab)-pVm fusion 
product, is available for assembly then the formation of 
a mature infectious phage particle could occur. With this 
in mind, the H chain-gVIII expression vector Ml 31X31 
was constructed to contain two copies of gVTJI. The copy 
that anchors F(ab) to the surface of M13 has had gVIH 
codons extensively altered (pseudo gVIH) to prevent re- 
combination with the wild-type gVm. 

To demonstrate the utility of these vectors we cloned 
the antibody V L -C*- and V H -C M 1 -encoding sequences f rom 
the chimeric L6 transfectoma cell line Into the Ml 31X1 2 
and Ml 31X31 vectors, respectively. The two vectors were 
recombined to create M13IXL604. M131XL604 expresses 
a diclstronlc message encoding both H and L chain L6 
sequences under transcriptional control of an inducible 
Lac promoter. An amber stop codon resides between the 
H chain-encoding region and the pseudo gVUI coat pro- 
tein^ When grown in an amber suppressor strain {supE) 
of E. colt. M13IXL604 was shown to produce L6 H chain- 
pVlII fusion protein resulting in phage-dlsplayed F(ab) in 
addition to free F(ab). When free F(ab) was desired as the 
major end product, the phage was grown In a nonsu re- 
pressor strain [supO). The M13IXL605 vector, which 
solely produces F(ab)-pViIl fusion protein, was con- 
structed to evaluate, by comparison, the M 131XL604 con- 
trol system. 

M13IXL605 was found to display slightly higher levels 
of phage-assoclated F(ab) than M13DCL604 when grown 
In the supE bacterial strain. This Is likely attributed to a 
higher proportion of F(ab)-pVIII fusion protein relative to 
wild-type pVTU produced in the M13IXL605- Infected 
strain. In both the supE and supO bacterial strains 
M13IXL604 secreted chimeric L6 F(ab) at levels some- 
what higher than those of M13IXL605. Phage titers of 
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M13IX31-. M13DCL604-. and M13KL605 -infected cul- 
tures were found to decrease relative to the level of F(ab)- 
pVm fusion protein incorporation (D. Yelton, unpub- 
lished observations). Taken together, these results sug- 
gest that a functionally viable phage particle may be able 
to tolerate a limited number of Incorporated F(ab)-pVllI 
fusion products and that the amount of F(ab) incorpo- 
rated Into the phage coat may Inversely affect phage titers 
and overall F(ab) yield. Furthermore, the appearance of 
free F(ab) in the culture media suggests that H chain- 
pVm fusion protein unincorporated into phage is properly 
assembled with L chain and secreted as functional F(ab). 

The set of anti-Id antibodies raised against the murine 
mAb L6 demonstrated that faithful expression and func- 
tion of bacterially produced chimeric L6 F(ab) occurs in 
the M 1 3IX 1 2 and M 1 31X3 1 vector system. L6 F(ab) activ- 
ity is readily detectable by either the nitrocellulose filter 
lift or ELISA formats. The anti-Id antibodies used in these 
studies serve as a convenient and Informative mode) for 
antibody-Ag binding. Inasmuch as the tumor Ag bound 
by L6 in vivo is yet to be purified or fully characterized. 

The M13IX31 and M131X12 vector system described 
here can serve as a versatile, general purpose approach 
to F(ab) production and screening. F{ab) production in 
this system is sufficiently robust to permit multiple rep- 
licate filter lifts, a practical requirement for Implement- 
ing various screening strategies. The vectors can be used 
to create combinatorial antibody libraries to identify 
novel antibodies. Screening of sufficiently large combi- 
natorial antibody libraries could potentially allow useful 
antibody fragments of murine or human origin to be 
isolated without the necessity to perform standard im- 
munization procedures (35). In addition, because ollgo- 
nucleotlde-directed mutagenesis Is convenient and highly 
efficient In Ml 3 these vectors are ideal for engineering 
antibodies with new properties. It Is conceivable that the 
ability to create stable, high affinity human antibodies is 
a promising endeavor. 

In a companion publication (36). we show that the 
Ml 31X31 and M13DC12 antibody expression system 
lends itself to efficient antibody engineering by site-di- 
rected mutagenesis. Codon-based mutagenesis of L6 hy- 
pervariable regions proved effective in altering the fine 
specificity of L6 in a predefined manner. Thus the system 
allows mutagenesis, screening, and mutant F(ab) produc- 
tion to be accomplished very rapidly. 
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A PHAGE DISPLAY SYSTEM FOR DETECTION OF T CELL 
RECEPTOR-ANTIGEN INTERACTIONS 

TAKEBUMI ONDA/tt DRAKE LAFACE,*t§ GOTTFRIED BAIER,*|| 
THOMAS BRUNNER,* NAKAYUKI HONMA,H TOSHIFUMI MIKAYAMA,1f 

AMNON ALTMAN* and DOUGLAS R. GREEN* 
•Division of Cellular Immunology, La Jolla Institute for Allergyand Immunology, 1 1 149 N. Torrcy 
Pines Rd, La JoUa ? CA 92037, U.S.A.; IjKirin Brewery Co. Ltd. Pharmaceutical Research Laboratory, 

Macbashi Shi Gunma; Japan 

■{First received 19 June 1995; accepted in revised form 30 June 1995) 

Abstract— The process of T cell recognition involves a complex set of interactions between the various, 
components of the TCR/MHC. peptide trimolecular complex. We have developed a system for 
exploring the specific binding interactions contributed by the constituent subunits of TCR complexes 
for components of their ligands. We utilized an M13 phage display system, designed for multivalent 
receptor display, to explore specific binding interactions between various TCRa chains and specific 
antigen in the absence of MHC The multivalent TCR-phage display system was sensitive enough to 
reveal some TCRa chains capable of binding directly to antigen with the same fine specificity shown 
by the MHC-restricted T cells from which the a chains were derived. Cross-specificity analysis using 
two antigen-binding TCRat chains derived from T cells with different polypeptide antigen specificities 
confirmed the fidelity of this binding. In mixtures of antigen-binding and non-binding TCRa-dis- 
playing phage, specific selection was achieved at a starting frequency of 1/1000, suggesting that this 
system can be employed for selection and analysis of TCR-displaying phage libraries. While the 
binding specificities exhibited by these TCRs are unusual, they provide a novel perspective from 
which to study the specific binding interactions thai constitute TCR antigen binding. 

Key words: MI3 filamentous phage, peptide binding, phage-display, TCR. 



INTRODUCTION 

Specific recognition by T lymphocytes is mediated by T 
cell antigen receptors (TCR) (Jorgensen et a/., 1992b; 
Chien and Davis, 1993). The most extensively studied 
TCR-hgand interactions involve TCRa/? heterodimers 
binding to peptide-MHC complexes. A number of stud- 
ies have focused on defining the intricate biochemical 
interactions between the various components of the TCR 
and MHC-peptide complexes. As a consequence of these 
investigations, it has been suggested that residues within 
the CDR3 regions of TCRa and fi chains interact with 
specific residues from the antigenic peptide fragment 
(Davis and Bjorkman, 1988; Engel and Hedrick, 1988; 
Danska et aL; 1990; Jorgensen et aL, 1992a). More recent 
studies have indicated that TCR interactions with peptide 
appear to be very specific, while interactions with MHC 
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are comparatively degenerate (Ehrich et aL, 1993). The 
TCR-MHC interactions could be established in a variety 
of configurations with the same TCR and MHC and 
appear to be influenced by the interaction of the TCR 
with the peptide. Thus, while TCR interactions with resi- 
dues from both peptide (Engel and Hedrick, 1988; Dan- 
ska et aL, 1990; Jorgensen et aJ., 1992a) and MHC 
(Ajitkumar et a!., 1988; Peccoud et aL, 1990) appear to 
be essential for efficient T cell recognition, the dominant 
interactions are seemingly mediated by CDR3 residues 
associated with specific peptide (Ehrich et aL, 1993). 

Although most studies have focused on conventional 
TCR binding of MHC-peptide ligands, a number of 
alternative TCR-hgand interactions have emerged, pro- 
viding the basis for further studies in TCR specificity. 
Examples include TCR j$ chain-binding to superantigens 
(White et aL, 1989; Gascoigne and Ames, 1991; White 
et aL, 1993); TCRs which specifically bind peptide-free 
mycolic acid antigens in the context of CDI restricting 
elements (Porcelli et aL, 1992); and TCRs with speci- 
ficities for carbohydrate moieties of post-translationaily 
modified peptide antigens (Haurum et a/., 1994; 
Michafilsson et aL, 1994). These recent studies of non 
conventional TCR-ligand interactions lend credence to 
earlier studies of T cells which expressed afi TCRs capable 
of binding directly to hap ten-conjugates (Rao et aL. 
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19&4a, 1984b; Siliciano et aL, 1985) or peptide antigens 
(Esch and Thomas, 1990) in the absence of MHC 
proteins. Interestingly, direct TCR-binding interactions 
with hapten or peptide iigands, in the absence of MHC, 
demonstrated relatively high affinity constants (KqS = 5 
x 10~ J -6 x I0" 6 ) when compared to the normal range 
of TCR-ligand affinities (Rao et aL, 1984b; Siliciano et 
aL, 1986; Esch and Thomas, 1990; Davis and Chien, 
1993). 

Studies of TCR that directly bind nitrophcnol haptens 
suggested that the specificity was mediated by the TCRa 
subunit in a manner independent of TCR0 (Kuchroo et 
aL, 1991). Similar studies by our group and others have 
suggested the possibility that a subset of TCRa molecules 
may bind directly to antigen for which the T ceH encoding 
the TCR protein is specific (Bissonnette et aL, 1991; 
Green etaL. 1991; Mori et aL, |993). While these unusual 
TCR-ligand interactions may represent a relatively small 
subset of TCR specificities, they are nonetheless 
intriguing because they provide an alternative approach 
with which to explore the many complex interactions that 
mediate specific binding. Since the constituent a and P 
chains of TCR may bind different residues of the peptide 
ligand component in a relatively autonomous manner 
(Jorgensen et aL, 1992a, 1992b), we reasoned that it 
should be feasible to study directly the specific inter- 
actions of TCRa chains with antigenic peptide. An 
elemental analysis of the specific interactions that con- 
stitute the complex binding of TCR with MHC-pcptide 
ligands would be extremely useful. While these studies 
would not refute existing theories of conventional TCR- 
antigen binding, they would provide a unique perspective 
from which to study the individual specific binding inter- 
actions between the various subunits of the TCR- 
MHC/peptide complex which contribute to the overall 
TCR binding specificity. 

We employed the filamentous phage display system for 
the study of immunological receptors (Kang et aL, 1991; 
Barbas and Lerner, 1991b) to directly assess specific bind- 
ing interactions of several TCRa polypeptides with anti- 
genic peptide and globular proteins in the absence of 
MHC. The multivalent phage display system, utilized to 
enhance avidity, was optimally suited to study inter- 
actions of TCR molecules directly with their peptide 
ligand, as the binding affinities are relatively low com- 
pared to immunoglobulin receptors. 

EXPERIMENTAL PROCEDURES 

Reagents, strains, vectors 

The Escherichia colt XLl-Blue (teO and VCSM13 
helper phage (kan r ) were purchased from Stratagene. AH 
enzymes were purchased from Boehringer Mannheim 
and Promega. The phagemid vector pComW (Kang et 
aL, 1991) utilized to produce clones encoding TCRa 
chains fused to the cpVIII of M13 filamentous phage 
was generously provided by Dcnise Burton (The Scripps 
Research Institute). All molecular biology procedures 
were performed according to conventional techniques as 



described in Molecular Cloning: a Laboratory Manual 
(Sambrook et aL, 1989) unless indicated otherwise. 

Peptides and antibodies 

Poly 1 8 peptides were kindly provided by Dr Bhagirath 
Singh (University of Western Ontario, Canada). Bee 
venom phospholipase A 2 and bovine phospholipase A 2 
were purchased from Sigma. Cell lines producing hamster 
anti-mouse TCRa (H28-710.16) and anti-TCR/? (H57- 
597 hamster IgG) were kindly provided by Dr Ralph 
Kubo (Cytel, La Jolla, CA, U.S.A.). Supernatants were 
concentrated by precipitation with 45% saturated 
ammonium chloride followed by dialysis, and the anti- 
bodies were purified by protein A chromatography. 

The TCRa cDNA and clones 

Full-length cDNA sequences encoding the A 1 . 1 TCRa 
(Green et aL, 1991). 5C.C7 TCRa (Fink et aL, 1986) 
(provided by S. Hedrick, University of California, San 
Diego, CA, U.S.A.) and the 3B3TCRa (Mori et aL, 1993) 
(T. Mikayama, unpublished sequence), respectively, were 
used as templates in the polymerase chain reaction (PCR) 
subcloning procedures. Fifty nanograms of cDN A tem- 
plate was mixed with 60 pmol of each primer (Table 1), 
200 mM dNTP, Promega Taq polymerase buffer con- 
taining 1 .5 mM MgCl 2 and five units of Taq polymerase 
in a final volume of 100 ml. Amplification was carried out 
for 20-30 cycles on a TwinBlock thermal cycler (Ericomp 
Inc., San Diego, CA t U.S.A.) under the following con- 
ditions: 1 min of denaturation at 94 n C, 2 min of primer 
annealing at 50°C, 1 min of elongation at 72°C, followed 
by a final elongation at 72°C for 15 min. Amplified prod- 
ucts were hydrolysed (4 hr at 37°C) at the Xhol and Xbal 
sites (25 U cnzyme//ig PCR fragment) encoded by the 
primers, size-fractionated by agarose gel electrophoresis 
and purified using a Geneclean procedure (Bio 101 Inc., 
La Jolla, CA, U.S.A.). The PCR primers were designed 
to produce TCRa cDNA sequences that could be ligated 
into the pCombS vector in frame with the pclB leader 
sequence at the 5' end and fused \o the N-terminus of the 
cpVIII protein on the 3' end. The purified - Xbal 
insert was directionally cloned into the phagemid vector 
pComb8 at the Xhol and Spei sites and transformants 
were screened with internal oligonucleotide probes in 
colony lifts. Plasmid DNA from positive clones were 
sequenced by the dideoxy method using Scquenase 2.0 
(USB Corp., Cleveland, OH, U.S.A.), analysed using 
the MacVector analysis program (IBI, New Haven, CT, 
U.S.A.) confirming identity, orientation and in-frame 
cloning of TCRa cDNAs. 

Recombinant phage preparation 

Production of recombinant filamentous phage dis- 
playing TCRa chains was carried out essentially as pre- 
viously described for the production of phage displaying 
immunoglobulin receptors (Kang et aL, 1991; Barbas and 
Lerner, 1991b). Briefly, £. coli XLl-Blue (Stratagene) 
cells were transformed with recombinant TCRa/pComb8 
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Variable region primers 

TCR Vat (sense) 
5CC7 

TCR V* (sense) 
3B3 

TCR Va (sense) 



Constant region primers 

TCR-VJC 

TCR Cx (antisense) 

TCR-VJ 

Truncated Cor (antisense) 



AI 



5-OAAGAGCECGAQATGAAATCC TGAGT-3' 

5'-TGGACAdCGAQATGCAGAGGAAC( 

5'-GCXKXGCIXX5AGATGAAATCCTTGAGTX}TTTTACTA 

GTGGTCCTGTGGCTCCAGTTAAACTGCGTGAGGAGC- 

CAGCAGCAAGTGCAGCAGAGTCCTGCA-3' 

5M3CTGTOO^£LACk:CACCGCCACCGTC 
5'^GGAGlCIAGAGCCA(XGCCAC(XrrcGAC^ 



Restriction sites used 



phagemid and selected on LB plates containing 100 /ig/ml 
ampicillin. Fresh ampicillin-resistant colonies were 
grown in liquid culture on a shaker at 37°C in super- 
broth (SB) for 1-2 hr in the presence of ampicillin (50 
/ig/ml) to select for cells bearing phagemid and tetra- 
cycline (10 /ig/ml) to induce the F episome. The cultures 
were then incubated with I raM isopropyl 0-r>thio- 
galactoside (EPTG) for 1 hr to induce production of the 
TCR-cpVlII fusion protein and then superinfected with 
VCSM13 helpeij phage (10 12 pfu) (Stratagene) to give a 
final phage/cell ratio of 10-20:1. The cells were shaken 
(37°C) for an additional 2 hr and then selected for 
helper phage induced antibiotic resistance (kanamycin, 
70 jig/ml) and grown overnight, 37"C, 250 rpm. Phage 
supernatant was cleared by centrifugation of the cultures 
(4000 rpm in a GSA rotor, Sorvall, at 4°Q. The phage 
were precipitated by adding 3% (w/v) NaCl and 4% (w/v) 
polyethylene glycol 8000 for 1 hr at 4'C and centrifuged 
at 9000 rpm in a Sorvall GSA rotor at 4°C. The phage 
pellets were resuspended in PBS to 1/25 of the original 
volume and aggregated phage and debris were removed 
by centrifugation for 5 min in a benchtop micro- 
centrifuge. Phage supernatants were transferred to fresh 
tubes and stored in aliquots at— 20 3 C. 

Affinity selection panning 

The panning procedure utilized to screen binding of 
TCR displayed on the surface of filamentous phage was 
a modification of the original protocol described by 
Parmky and Smith (1988). Wells of a microti tration 
ELJSA plate (Immulon 2. Dynatech) were coated with 
peptide or phospholipase Aj (Sigma) in sodium bicar- 
bonate (0.1 M, pH 9.5) at 4°C overnight in a volume of 
50 pi at the concentrations indicated in the figure legends. 
Antibodies were coated onto plates using Tris (50 mM, 
pH 9) at 4°C overnight in a volume of 50 pi at the 
concentrations indicated in the figure legends. The wells 
were washed twice with PBS (Dulbccco's, pH 7.4) and 
blocked by completely filling the well with 3% bovine 
scrum albumin (BSA) (Sigma, fraction V) and incubated 
for 1 hr at 37°C. The blocking solution was flicked out, 



rinsed with PBS and 50 /il recombinant phage was added 
to each well (typically 10 lo -l0" CFU unless otherwise 
indicated) and incubated for 2 hr at 37°C. The phage 
were then removed and the wells were washed 10 x (pip- 
etting up and down to wash) with TBS/Tween (50 mM 
Tris-HCl, pH 7.5, 150 mM NaCl, 0.05% Tween 20) over 
a period of 1 hr allowing the wells to remain filled between 
washings. Washing was done carefully to ensure simi- 
larity between samples, experiments and individuals. 
Adherent phages were eluted with 50 /il elution buffer 
(0.1 M HQ adjusted to Ph 2.2 with solid glycine, 1 mg/ml 
BSA) for 10 min at room temperature and neutralized 
with 6 ml of 1 M Tris base. Eluted phage were used to 
infect fresh £. coii XL I -Blue cells, ODmo = I (grown in 
the presence of 10 /ig/ml tetracycline to induce F' episome 
expression) for 15 min at 37°C followed by selection 
on LB/ampicillin plates to assess the number of bound 
recombinant phage per ml of eluted phage. 

Immunoblot of TCRa-pVHI fusion proteins 

Phagemid DNA of TCR-pComb8 or control con- 
structs were transformed in XLl-Blue cells and trans- 
formants were inoculated in 25 ml 2xTY medium 
containing 50 tig/ml ampicillin. Protein expression was 
induced with 0.1 mM IPTG at an OD«o = 1 for 7 hr at 
26°C. Cells were harvested by centrifugation. sonicated 
on ice for I min in PBS, 1 mM PMSF, 1% NP-40 and 
the resulting lysate was centrifuged (13,000g, 4 C Q to 
remove insoluble debris. The soluble cellular proteins 
were analyzed by SDS^PAGE and immunoblotted with 
an anti-TCRot antibody (H28-7 10.1 6). 

Presentation assay 

The TCR specificity of A 1.1 hybridoma cells for our 
poly 18- related peptide analogues was assessed by mea- 
suring the 1L-2 responses of Al.i cells to peptide-pulsed 
BALB/c (I-A d ) spleen cells (y-irradiated 2000 rad). Super- 
natants were collected following overnight peptide stimu- 
lation and the relative production of IL-2 was assessed 
in a CTLL assay. Starved CTLL cells were washed and 
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incubated in various dilutions of sample or control super- 
natants (100 pi) in 96-well plates (Costar) at 37°C for 
16 hr. The celb were then pulsed with [ 3 H}-thymidine 
(1 //Ci/well) and harvested an additional 4-6 hr later. The 
IL~2Kiependent proliferation was assessed by the relative 
level of [ 3 HJ- thymidine incorporation (cpm). 

Hybridization of replica plates 

Following two rounds of affinity selection panning 
(described above), ampicillin-resistant CFUs were blot- 
ted onto replica filters as described (Sambrook el aL, 
1989). The replica filters were denatured (0.5 N NaOH, 
1.5 M NaCl) and neutralized (1.5 M Nad, 0.5 M Tris- 
HCL (pH 7.4)1 and then fixed (80°C, 2 hr). .The filters 
were then hybridized with a 5' ^-labelled A 1 . 1 TGR Va 
specific probe (5'GAAGAGCTCGAGATGAAATC- 
CTTGAGT-3). 



A 



Cce TM 



VJCa 



V, 



X ^~L./ 

V V 



RESULTS 

The TCR phage display 

We previously presented evidence suggesting that the 
TCRa molecule from the Al.l T cell hybridoma (Va 1.2, 
JaTA65) may bind directly to antigenic peptides for 
which this T cell is specific (Bissonnette et al. f 1991; Green 
et al. t 1991). The Ml 3 filamentous phage display system 
(Kang et aL, 1991; Barbas et aL y 1991a; Barbas and 
Lerner, 1991b) provided the means to direcdy assess the 
ability of A 1.1 TCRa polypeptides to specifically bind 
to peptide antigens. Wc utilized the phagemid vector, 
pCombS (Kang et al. f 1991) to generate recombinant 
phage-displaying TCR proteins fused to the N-terminus 
of the coat protein VIII (cpVIII). The cpVIII molecules 
form the capsid coat during phage assembly (utilizing 
about 2500 cpVIII proteins per phage particle) allowing 
for multiple receptors to be displayed on the surface of 
the recombinant phage (Felici et aL. 1991; Greenwood et 
al. 9 1991; Kang et aJ. t 1991). This multivalent expression 
can greatly enhance the avidity, which facilitates the 
study of binding interactions with moderate affinity. 

The PCR-generated fragments encoding the VI- or 
VJC-domains of the A 1 . 1 TCRa molecule were subcloned 
into the pCombS phagemid vector to generate recom- 
binant phage displaying multiple copies of the A 1.1 
TCRa molecule on the surface. This is illustrated in Fig. 
1 (see Experimental procedures). Extracts of cells trans- 
formed with various recombinant or control phagemids 
were subjected to SDS-PAGE and immunoblotted with 
an anti-TCR Ca specific antibody (H28.710.16) to con- 
firm that TCRa/cpVIII fusion proteins were produced. 
As shown in Fig. 2A, a protein that migrated with the 
predicted molecular weight for the TCR-VJC/cpVIII 
fusion protein, Mr 36.5 kDa, was detected from extracts 
of cells transformed with the Al.l TCR-VJGx construct. 
A second protein of M r 34.5 kDa was also detected, which 
probably represents a partially degraded form of the 
fusion protein, since it was not observed in the other 
lanes. No bands were observed from extracts of cells 
transformed with either the parental phagemid (pComb8 > 
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Fig. 1. Schematic protocol for the generation of chimeric con- 
structs. (A) Full-length cDNA sequences encoding various 
TCRa chains were used as templates in the polymerase chain 
reaction (PCR) subcloning procedures. The 5' PCR primers 
(arrows) contained an Xhol site designed to generate a PCR 
fragment that could be ligated into the pCombS vector in frame 
with the pelB leader sequence. The 3' primers (arrows) con- 
tained a sequence encoding a flexible linker (GGGS) and an 
Xbal site to generate a fusion gene with the major cpVIII upon 
ligation. The PCR fragments were digested with Xbal and Xhoh 
purified and directionaily cloned into the phagemid vector 
pComb8. (B) The recombinant TCRa-pComb8 phagemids 
contained an amp' gene utilized to facilitate drug selection of 
recombinant phage. Production of the recombinant 
TCRa/cpVIII fusion protein was driven by the inducible LacZ 
promoter following transformation of competent XL 1 -Blue 
cells (Stratagene). Transport into the periplasmic space during 
production was mediated by the pelB leader sequence. (Q The 
predicted structure of the recombinant TCRa/cpVIII fusion 
proteins consisted of a TCRa polypeptide and a flexible linker 
(GGGS) fused to the N- terminus of the cpVIII molecule. The 
pelB leader sequence at the N-terminal end is cleaved off fol- 
lowing transport into the periplasmic space. The recombinant 
fusion protein was incorporated into the viral capsid coat dur- 
ing phage assembly to generate recombinant M13 phage dis- 
playing TCR molecules on the surface. 



vector only) or the Al.l TCR VJa construct, which both 
lack the Ca domain recognized by the H28.710.16 anti- 
body used for detection. 
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Fig. 2. Expression of TCRa/pComb8 recombinant fusion 
proteins. (A) Western blot analysis to delect expression of A 1 . 1- 
TCR-VJCa/cpVIH fusion proteins in £. coli. Ly sates from 
recombinant TCR*-pComb8 (lane 1: Al.i-TCR VJ; lane 2: 
AI.l-TCR-VJQ or parental pCorob8 (lane 3: vector only) 
phagemkl transformed XL 1 -Blue cells were subjected to SDS- 
PAGE and immunoblotted with an anti-TCR Ca antibody 
(H28.710.16). The position of the band migrating at the pre- 
dicted molecular weight of the TCR-VJGx/cpVIH fusion pro- 
tein (M r 36.5 kD) is indicated by the arrow. (B) Display of 
TCR-YJCa polypeptides on the surface of recombinant phage. 
Binding of recombinant phage to anti-TCR antibody was uti- 
lized to detect the presence of TCR on the phage surface. 
Recombinant TCRct-pCombS phage or parental pComb8 
phage (indicated in the figure) were panned on plates coated 
with antibodies specific for TCRCcr (H28.710.16, solid bars) or 
TCR V06 (H57-597. hatched bars). The number of bound 
phage was a ssesse d by counting the number of duted amptdlltn 
resistant CFU following extensive washing as described in the 
experimental procedures. 



The presence of TCRa molecules displayed on the sur- 
face of recombinant phage was assessed using anti-TCR 
antibodies. Recombinant phage were "panned" on plas- 
tic dishes (Parmley and Smith, 1988) coated with either 
anti-TCR Ca (H28-710.16) or anti-TCR Cfi (H57-597) 
antibodies (Fig. 2B). Following extensive washing, the 
bound phage were eluted with a low pH buffer. Sub- 
sequent infection of host cells with eluted phage was 
readily achievable as the cpIII proteins, utilized for 
adsorption to the F' episome and infection of E. coli f 
remain intact following acid elution. Since the pComb8 
phagemid vector contains an amf gene, the number of 
bound phage could be assessed by counting the number 
of ampicillin-resistant CFU following infection of host 
cells with eluted phage, in addition to recombinant phage 
displaying A 1 . 1 TCRa chains, we also constructed phage 
displaying the TCRa chains from the 5C.C7 T eel! hybrid- 
oma (specific for pigeon cytochrome C peptide restricted 



to I-E k ) (Fink et a/., 1986). As shown in Fig. 2B, both 
the A 1.1 TCR-VJCa as well as the 5C.C7-TCR-VJCa 
recombinant phage were readily bound by the anti-TCRa 
but not the anti-TCR^ antibodies. As predicted, the par- 
ental pComb8 phage (vector only) and recombinant 
TCR-VJa phage showed no binding to either antibody 
as these phage lack the TCR Cot determinant recognized 
by H28.710.16 antibody. Thus, polypeptides bearing 
TCR Ca specific determinants were effectively displayed 
on the surface of recombinant phage particles, indicating 
that TCRa/cpVIII fusion proteins could be effectively 
incorporated into the capsid coat during phage assembly. 

Specificity of TCR displayed on recombinant phage 

The A 1 . 1 T hybridoma cells recognize a synthetic poly- 
peptide with the sequence poly [EYK(EYA) 5 ) (poly 18) 
presented by I-A d (Fotedar et a/., 1985). The antigenic 
fine specificity of A 1 . 1 cells for a series of poly 1 8-related 
peptide analogues was assessed by measuring the IL-2 
responses to peptide pulsed BALB/c (I-A d ) spleen cells. 
As shown in Fig. 3 A, A 1.1 cells produced cytokine in 
response to two of the peptides, EYK(EYA) 4 and EYK- 
(EYA) 4 EYK. Peptide analogues, substituted with ala- 
nines at residues 3 or 10, failed to stimulate A 1.1 cells. 
These results are in agreement with those reported by 
others for this cell line (Fotedar et aJ., 1985). The anti- 
genic fine specificity of the A 1.1 hybridoma cells for the 
four peptides described above provided a model with 
which to compare specificity of TCRa molecules dis- 
played on recombinant phage. 

The poly 18-related peptide analogues utilized to 
characterize the specificity of A 1.1 hybridomas were 
coated onto 96-well plates for panning experiments to 
assess the binding capacity and specificity of the TCRa 
molecules displayed on the phage surfaces. Bound phage 
were extensively washed, eluted and then quantitated by 
counting the number of ampicillin-resistant CFUs fol- 
lowing infection of host cells. Representative of a series of 
10 binding experiments, Fig. 3B shows that recombinant 
phage expressing A 1.1 TCR-VJCa preferentially bound 
to the antigenic peptides, EYKEYAEYAEYAEYAEYK 
and EYKEYAEYAEYAEYA, but not to non-antigenic 
peptides with an alanine substitution at residues 3 (EYA- 
EYAEYAEYAEYA) or 10 (EYK EYAEY A*4 Y AEYA). 
The parental pComb8 phage (vector only) and recom- 
binant phage displaying 5C.C7-TCR-VJCa showed no 
specific binding to any of the four poly 18 peptide ana- 
logues (Figs 3B and 4A). Recombinant phages displaying 
A 1.1 TCR VJa, which lack the Ca-domain, also showed 
preferential binding to the antigenic peptides, indicating 
that the specific binding was mediated by the TCR Va- 
and Ja-region domains (Fig. 4B). The observed pref- 
erential binding to antigenic peptide was reproducible 
and consistent over a wide range of phage titres when 
we collected the quantitative data from seven different 
experiments utilizing several different recombinant phage 
preparations from two different recombinant constructs 
(Fig. 4C). Thus, direct binding by phage displaying Al. I 
TCRa to peptides reflects the antigenic specificity of the 
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Fig. 3. Antigenic fine specificity of A 1.1 hybridoma cells and 
Al.l-TCRa Recombinant phage. (A) Fine specificity of Al.l 
hybridoma cells. The ability of poly IS related peptide ana- 
logues to induce an IL-2 response was utilized to assess the fine 
specificity of Al.l hybridoma cells. The BALB/c spleen cells (1- 
A*) were pulsed with the indicated poly 18 related peptide 
analogues and co-cultured with Al.l hybridoma cells as 
described in the experimental procedures. The relative amount 
of IL-2 produced in response to the peptide antigens is indicated 
in the K-axis as the amount of tritiated thymidine uptake in a 
CTLL assay. (B) Comparative fine specificity of peptide binding 
by recombinant Al.l-TCR phage. The binding capacity of 
recombinant phage displaying Al.l-TCR-VJCat was assessed 
using the same four poly 18 related peptide analogues used to 
assess the fine specificity of the Al.l hybridoma cells. Plates 
were coated with the indicated peptide analogues followed by 
panning with recombinant phage displaying Al.l TCR-VJd. 
The number of bound phage following extensive washing was 
assessed by determining the number of eluted ampicillin resist- 
ant CFU. 

Al.l hybridoma from which the TCRa cDNA sequence 
was derived (see Fig. 3A), despite the absence of class II 
MHC presentation. 

Affinity selection of specific TCRa 

To assess the capacity for affinity selection of phage 
displaying specific TCR polypeptides, recombinant 
phage displaying Al.l TCR-VJCa were mixed 1:100 or 
1:1000 with phage displaying 5C.C7-TCR-VJCa and 
then subjected to two rounds of affinity enrichment on 
plates coated with antigenic poly 18 peptide analogues. 
The eluted phage were used to infect host cells and col- 
onies from ampicillin plates were lifted onto nitro* 
cellulose filters. The colony-lifts were lysed and 
hybridized with a DNA probe specific for the A 1 . 1 TCRa 



sequence. We found that 85% (468/554) of the colonies 
from the 1:100 mix and 76% (113/149) of the colonics 
from the 1:1000 mix contained Al.l TCRa specific 
sequences (Fig. 5). Thus, two rounds of selection enriched 
the phage displaying Al.l TCRa by over 500- and 3000- 
fold, respectively. These results suggest that the phage- 
display system can be utilized to select for T cell receptors 
capable of binding directly to peptide ligands and has 
potential to be utilized as a gene identifying system for 
TCRa chains with relatively high affinity for their peptide 
ligand. 

The TCRa binding directly to protein antigen 

While phage displaying 5C.C7-TCR-VJCa railed to 
bind to the poly 18 peptide analogues bound by Al.l 
TCRa recombinant phage, these phage also failed to bind 
antigenic cytochrome C peptide, in repeated attempts 
(data not shown). Thus, differences in detectable binding 
of TCR Va to specific antigenic peptides may represent 
variance in the particular TCR-ligand interactions such 
that only a subset of TCR, including that on Al.l cells, 
have direct interactions strong enough to be detectable 
in this system. Accordingly, we constructed recombinant 
phage displaying TCRa from the 3B3 T cell hybridoma, 
which has previously been suggested to express a TCRa 
chain product capable of direct interaction with antigen 
(Mori et al., 1993). The 3B3 hybridoma was generated 
from BALB/c mice immunized with bee venom phos- 
pholipase A 2 (PLAz) and is specific for a bee venom 
PLA 2 derived peptide (residues 19-34) in the context of 
I-A d (Mori et aJ. f 1993). Two aspects of 3B3 TCR binding 
characteristics made it particularly interesting for phage 
display studies of direct antigen-binding TCRa chains. 
Firstly, three-dimensional structural analysis of bee 
venom PLA 2 indicated that the antigenic peptide residues 
associated with 3B3 TCR binding are exposed to the 
outer surface of the PLA 2 molecule in the form of a 
flexible loop (Scott et al. % 1990). Secondly, previous pep- 
tide competition studies of 3B3 TCR specificity suggested 
that the TCRa chain may be able to bind directly to 
unprocessed bee venom PLA 2 (Mori et a/., 1993). There- 
fore, we constructed recombinant phage displaying 3B3- 
TCR-VJCa to test the ability of this TCRa chain to 
directly bind to unprocessed bee venom PLA 2 protein in 
a specific manner. 

The PCR-generated fragments encoding the 3B3- 
TCR-VJCa polypeptide were subcloned into the 
pComb8 vector for production of recombinant phage. 
Bee venom PLA 2 was coated onto 96-well prates for pan- 
ning experiments to assess the binding capacity and speci- 
ficity of the 3B3 TCRa chains displayed on the phage 
surface. We observed binding of recombinant 3B3-TCR- 
VJCa phage, but not parental pComb8 phage (vector 
only), to bee venom PLA 2 coated wells (Fig. 6A). Fur- 
thermore, binding of recombinant 3B3-TCR-VJCa 
phage to BSA (3% BSA, used for blocking) was not 
observed. Moreover, the recombinant phage displaying 
3B3 TCR-VJCa did not bind to bovine PLA 2 (Fig. 6B), 
which lacks the exposed hydrophilic loop containing the 
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Fig^4. Antigenic binding is mediated by the V j regions of specific TCR-recombinant phage (A) The 
J?^ g T ^ of Parental pCombS phage or recombinant phage displaying AU-TCR-VJCx or 
5C CT-TCR-VJCaj for the indicated poly 1 8 related peptide analogues was assessed using the pining 
pro ocol described in the experimental procedures. Plates were coated with the indicated peptide 
analogues and panned with the indicated phage. The number of bound phage following extensive 
washmg was assessed by determining the number of ampiciUin resistant CFU/pI (B) The binding 
capacity of recombinant phage displaying TCR VJa (truncated C-region) from the Al.l or 5CC7 
hybndoma cells to the four indicated poly 18 related peptide analogues was determined as described 
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antigenic peptide sequence associated with 3B3 TCR 
binding (Dijkstra et a/., 1981; Mori ct al. y 1993). The 
observed preferential binding to antigenic bee venom 
PLA 2 was reproducible and consistent over a wide range 
of phage titres when we collected the quantitative data 
from five different experiments utilizing several different 
recombinant phage preparations (Fig. 6C). Thus; the 
phage display system enabled us to assess the novel 
capacity Of a TCRa chain to bind directly to antigenic 
protein in the absence of MHC 



In order to further assess the specificity of the TCRa 
recombinant phages for their respective antigenic ligands, 
we did parallel panning experiments with the Al.l TCR- 
VJCa and 3B3 TCR-VJCa recombinant phage (Fig. 7). 
Phage displaying Al.l TCR-VJCa showed binding to 
thc antigenic poly 18 peptide analogue (EYK- 
EYAEY AEY AEY AEYK) but not to bee venom PLA 2 . 
In contrast, phage displaying 3B3 TCR-VJCa dem- 
onstrated specific binding to bee venom PLA 2 but not to 
the poly 18 peptide analogue. Thus, the TCR phage- 



1394 



T. ONDA etai. 




C7a Pb»ge 

ALla*7Tot*l 
3/116 = <3% 



Al ia Ph»gt 

AUa + /ToUl 
63/70 = 90% 



Starting: Al.Ia/C7a = 1/100 
Selected 2x on EYK(EYA>4 

A 1.1 a* /Total 
468/55-1 = 85% 



Starting: Al.la/C7a * 1/10O0 
Selected 2x on EVK(EYA)4 

AUla + yToUl 
11*149 = 76% 



Fig. 5. Affinity selection of specific TCR on recombinant phage. 
A mixing experiment was done to assess the capacity for affinity 
selection of peptide specific TCR displayed on the surface of 
recombinant Al.I-TCRfl-pComb8 phage. The Al.l-TCR- 
VJCce phages were mixed 1:100 or 1:1000 with 5C.C7-TCR- 
VJCat phage and subjected to two rounds of affinity selection 
by panning as described in the experimental procedures. Colony 
lifts were hybridized with a radiolabeled A 1.1 -TCRa specific 
oligonucleotide probe. Autoradiographs of the hybridized rep- 
lica niters are shown in panels A-D. Autoradiographs of control 
colony lifts from plates inoculated with unmixed 5CC7 TCRa 
recombinant phage (panel A) or unmixed A 1.1 -TCRa recom- 
binant phage (panel B) are shown for comparison with plates 
from mixing experiments starting with Al.lot/5C.C7a ratios of 
1:100 (panel Q or 1:1000 (panel D) prior to the two rounds of 
affinity selection. 



display system described here facilitated the study of 
TCRa chains capable of binding directly to antigenic 
epitopes in a specific manner. 

DISCUSSION 

The phagt>display system (Smith, 1985) has proved to 
be extremely useful for the study of specific recepior- 
ligand interactions (Winter et a/., 1994). The advantage 
of this system for immunological studies is that the dis- 
play of receptors on recombinant phage provides a means 
to link directly antigen recognition structures and the 
genetic instructions encoding the receptor (Kang et a/., 
1991). Consequently, the phage display system has been 
invaluable for the characterization and selection of anti- 
body specificities for antigenic ligands. Here, we have 

shown that this system can be utilized further to study 
specific binding interactions of constituent TCR poly- 
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Fig. 6. Antigenic specificity of recombinant phage displaying 
TCRa polypeptide from 3B3 hybridoma celts." (A) The binding 
capacity of parental pCombS phage or recombinant phage dis- 
playing TCR-VJCr from the 3B3 hybridoma cell to bee venom 
PLA 2 was assessed using the panning protocol described in the 
experimental procedures. Plates were coated with the indicated 
concentration of bee venom PLA 3 and then panned with 10* 
CFU of recombinant phage displaying 3B3-TCR-VJCs or par- 
ental pCombS phage. The number of bound phage following 
extensive washing was assessed by determining' the number of 
ampicillin resistant CFU//J. (B) The specificity of recombinant 
phage displaying TCR-VJOr from the 3B3 hybridoma cell for 
bee venom PLA 3 and bovine PLA 2 was assessed using the pan- 
ning protocol. The number of bound phage following panning 
with 10 10 CFU of recombinant phage and extensive washing 
was ayyfy* by determining the number of ampicillin resistant 
CFU/jd. (C) The relative binding specificity of several recom- 
binant phage preparations, displaying 3B3-TCR-VJCa for bee 
venom PLA 2 or bovine PLA 2 arc compared from five different 
panning experiments as described above. 
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peptides to antigenic ligands. Specifically, the phagc-dis- 
play system facilitated a novel demonstration of direct 
binding by TCRa chains to peptide or protein antigens 
in a specific manner. These results indicate that for some 
TCRa chains the binding affinity to antigenic peptide 
may be sufficiently high that it can be detected in the 
absence of the TCR0 chain, and in an MHC-independcnt 
manner. It has been postulated previously that TCR 
binding to peptide by Va- and V/*-subunits may be rela- 
tively autonomous and that the resulting TCR-peptide 
interactions can dramatically influence TCR-MHC inter- 
actions; indicating the "primacy" of TCR-peptide inter- 
actions (Ehrich el a/., 1993). Our results extend these 
findings by demonstrating that the dominant interactions 
of certain TCRa chains for peptide antigens may be 
sufficiently high that they can be analysed independently. 
However, these interactions are quite unusual in that they 
do not require the expression of the second TCR subunit 
or normal MHC and coreceptor interactions. These 
results raise the concern that this model docs not reflect 
typical TCR-iigand interactions. Indeed, gene transfer 
studies of Al.l TCRa chain sequences indicated that the 
expression of the a chain alone was not sufficient to confer 
responsiveness to peptide antigen alone or in the context 
of I-A 4 (H. Zheng, A. Fotedar and D.R. Green; unpub- 
lished data). Furthermore, activation of Al.l bybridoma 
cells required specific peptide antigen to be presented in 
the context of the appropriate MHC (I-A d ) (Fotedar el 
a/., 1985). The requirement for MHC presentation and 
natural TCRa/7 conformation for functional interaction 
of cell surface expressed receptors may be essential for 
bringing the respective binding sites into close enough 
proximity to establish the specific binding interaction 
between the residues of opposite charge which mediate 
binding. The specificity data presented here (Fig. 3 and 
Fig. 4 and Fig. 7) suggest that these requirements may 
not be necessary for some receptors to detect specific 



binding in non-cellular TCR phage-display systems. 
Therefore, the data presented here do not suggest that 
the TCRa chain alone would be sufficient for T cell rec- 
ognition and activation, but rather that the TCR phage- 
display system provides a means to explore the specific 
binding interactions that mediate TCR binding to antigen 
at a constituent level. While this study does not refute 
commonly held theories of TCR-antigen interactions, 
this system does provide a means to explore the innate 
specific binding interactions contributed by the con- 
stituent subunits of TCR complexes for the peptide com- 
ponent of MHC-peptide ligands. In the absence of 
detailed structural data for the TCR/peptide/MHC tri- 
molecular complex, models such as this can provide 
important insight into the nature of the elemental inter- 
actions that constitute TCR binding to MHC-peptide 
complexes. Furthermore, such information should 
facilitate interpretation of the structural data for the 
trimolecular complexes when this becomes avail- 
able. 

Previous studies indicate that TCR-peptide binding 
interactions are very specific, since single amino acid 
changes had profound effects on binding (Engel and Hed- 
ricV, 1988; Danska e\ a/., 1990). In accordance with these 
results, the direct binding of phage-displayed TCRa 
chains to antigen appeared to be specific. Recombinant 
phage-displaying Al.l TCRa chains bound to antigenic 
peptide analogues but not to mutated peptide analogues 
that failed to activate Al.l hybridoma cells (Fig. 3 and 
Fig. 4). Additionally, the cross-specificity experiment 
indicated that recombinant phage-displaying Al.l TCR- 
VJCa or 3B3-TCR-VJCa bound their respective antigens 
but did not cross-react (Fig. 7). 

The failure of repeated attempts to demonstrate spec- 
ific binding of recombinant phage-displaying 5C.C7 
TCRa to cytochrome C peptide indicated that only a 
subset of TCR Va have the capacity for direct interactions 
with antigen strong enough to be detectable in this 
system. Previous studies of 5C.C7 TCR interactions with 
peptide antigen indicate that specific binding was 
mediated by CDR3 residues from both the a and p chains 
(Jorgensen el aL, 1992a, 1992b). Thus, binding to antigen 
by receptors such as 5C.C7 TCR may require the com- 
bined binding interactions of the a. and fi chains to 
mediate strong specific binding, while antigen binding by 
receptors such as Al.l and 3B3 TCR may be dominated 
by specificities mediated predominantly by the TCRa 
chain. 

The phage-display system described here extends the 
study of T cell recognition by providing a means to sim- 
plify the study of the specific interactions that constitute 
the overall receptor-ligand interactions. The power of 
the phage-display system is that recombinant phage can 
be amplified and subjected to multiple rounds of affinity 
selection as they contain the genetic instructions for pro- 
duction of the receptor molecule. A system for selection 
and cloning of antigen-binding TCR molecules will be 
useful in exploring TCR binding to various antigens and 
should provide a means to more closely examine TCR- 
ligand interactions. 



1396 



T. ONDA et al. 



Acknowledgements— We thank Dr K. Ishizaka and Dr A. Kang 
for helpful discussions. We thank Dr N. Gascoigne for critically 
reviewing the manuscript. We also thank Dr D. Burton (The 
Scripps Research Foundation) for the pComb8 phage display 
expression vector and Dr S. Hedrick (University of California, 
San Diego, U.S.A.) for the 5C.C7 TCRa cDNA sequence. This 
work was supported in part by National Institutes of Health 
grant GM 52735. D.M.L. was a Myasthenia Gravis Foundation 
Fellow. 



REFERENCES 

Ajitkumar P., Geier S. S.,Kcsari K. V., Borriello F., Nakagawa 
M., Bluestone J., Saper M. A., Wiley £>. C and Nathenson 
S. G. (1988) Evidence that multiple residues on both the 
• a helices of the class I MHC molecule are simultaneously 
recognized by the T cell receptor. Cell 54, 47-56. 

Barbas C F. r Kang A. S.» Lerner R. A. and Ben Iconic S. J. 
(1991a) Assembly of combinatorial antibody libraries on 
phage surfaces: The gene III site. Proc. natn. Acad. Sci. 
U.S.A. 88, 7978-7982. 

Barbas C. F. and Lerner R. A. (1991b) Combinatorial immu- 
noglobulin libraries on the surface of phage (Phabs): Rapid 
selection of antigen-specific Fabs. Metfi. Compan. Meth. 
EnzymoL 2, 119-124. 

Bissonnettc R., Zheng H„ Kubo R. T., Singh B. and Green D. 
R. (1991) A T helper cell hybridoma produces an antigen- 
specific regulatory activity. J. tmmun. 146, 2898-2907. 

Chien Y.-H. and Davis M. M. (1993) How afi T-cell receptors 
'see* peptide/MHC complexes. Imrmm. Today 14, 597-602. 

Danska J. S.. Livingstone A. M., Paragas V. t Ishihara T. and 
Fathman C. G. (1990) The presumptive CDR3 regions of 
both T cell receptor a and p chains determine T cell specificity 
for myoglobin peptides. J. exp. Med. 172, 27-33. 

Davis M. and Chien Y.-H. (1993) Topology and affinity of T- 
cell receptor mediated recognition of peptide-MHC 
complexes. Curr. Opin. Immun. 5, 45-49. 

Davis M. M. and Bjorkman P. J. (1988) T-cell antigen receptor 
genes and T-cell recognition. Nature 334, 395-402. 

Dijkstra B. W„ Kaik K. H. t Hoi W. G. J. and Drenth J. (1981) 
Structure of bovine pancreatic pbospholipase A 2 at 1.7 A 
resolution. /. molec. Biol. 147, 97-123. 

Ehrich E. W. f Devaux B M Rock E. P., Jorgensen J. L., Davis 
M. M. and Chien Y. H. (1993) T cell receptor interaction 
with peptide/major histocompatibility complex (MHQ and 
superantigen/MHC ligands is dominated by antigen. /. exp. 
Med. 178,713-722. 

Engel 1. and Hedrick S. M. (1988) Site-directed mutations in 
the VDJ junctional region of a T cell receptor ff chain cause 
changes in antigenic peptide recognition. Cell 54, 473-484. 

Esch T. R. and Thomas D. W. (1990) la-independent binding 
of peptide antigen to the T cell receptor. /. Immun. 145, 3972- 
3978. 

Fetici F. t Casta gnoli L. t Musacchio A., Jappelli R. and Cesareni 
G. (1991) Selection of antibody ligands from a large library 
of oligopeptides expressed on a multivalent exposition vector. 
X molec. Biol. 222, 301-310. 

Fink P. J., Matis L. A., McKlligott D. L., Bookman M. and 
Hedrick S. M. (1986) Correlations between T-cell specificity 
and the structure of the antigen receptor. Nature 321, 219- 
226. 

Fotedar A., Boyer M., Smart W., Widtman J., Fraga E. and 
Singh B. (1985) Fine specificity of antigen recognition by T 
cell hybridoma clones specific for poly- 18: a synthetic poly- 



peptide antigen of defined sequence and conformation. J. 
Immun. 135, 3028-3033. 

Gascoigne N. R. and Ames K. T. (1991) Direct binding of 
secreted T-cell receptor beta chain to superantigen associated 
with class II major histocompatibility complex protein. Proc. 
natn. Acad. Sci. U.S.A. 88, 613-616. 

Green D. R., Bissonnctte R., Zheng H., Onda T., Echcverri F M 
Mogil R. J., Steele J. K., Voralia M. and Fotedar A. (1991) 
Immunoregulatory activity of the T-cdl receptor a chain 
demonstrated by retroviral gene transfer. Proc. natn. Acad. 
Sci. U.S.A. 88, 8475-8479. 

Greenwood J., Willis A. E. and Perham R. N. (1991) Multiple 
display of foreign peptides on a filamentous bacteriophage. 
Peptides from Plasmodium falciparum circumsporozoite pro- 
tein as antigens. J. molec. Biol. 220, 821-827. 

Haunun J. S., Arsequeil G., Ldlouch A. C, Wong S. Y. C, 
Dwek R. A., McMkhad A. J. and Elliot T. (1994) Rec- 
ognition of carbohydrate by major histocompatibility com- 
plex class 1-reslrictcd glycopcptidc-specific cytotoxic T 
lymphocytes. J. exp. Med. 180, 739-744. 

Jorgensen J. L., Esser U., Groth B. F. i. S., Reay P. A. and 
Davis M. M. (1992a) Mapping T-cell receptor-peptide con- 
tacts by variant peptide immunization of single-chain trans- 
genics. Nature 355, 224-230. 

Jorgensen J. L., Reay P. A., Ehrich E. W. and Davis M. M. 
(1992b) Molecular components of T-cell recognition. A. Rev. 
Immun. 10, 835-S73. 

Kang A. S. f Barbas C. F., Janda K. D., Benkovic S. J. and 
Lerner R. A. (1991) Linkage of recognition and replication 
functions by assembling combinatorial antibody Fab librar- 
ies along phage surfaces. Proc. natn. Acad. Sci. U.S.A. 88, 
4363-4366. 

Kuchroo V. K., Byrne M. C M Atsumi Y., Greenfield E., Con- 
nolly J. B., Whitters M. J. Jr, R. M. O. H. Collins M. and 
Dorf M. E. (1991) T-cell receptor a chain plays a critical role 
in antigen-specific suppressor cell function. Proc. natn. Acad. 
Sci. U.S.A. *8, 8700-8704. 

Michaelsson E., Malmstrom V., Reis S., Engstrom A., Bur- 
khardt H. and Holmdahl R. (1994) T cell recognition of 
carbohydrates on type 11 collagen. J. exp. Med. 180, 745- 
749. 

Mori A., Thomas P., Tagaya Y., lijima H„ Grey H. and Ishi- 
zaka K. (1993) Epitope specificity of bee venom pbos- 
pholipase A 2 -spectfic suppressor T cells which produce anti- 
gen-binding glycosylation inhibiting factor. Int. Immun. 5, 
833-842. 

Parmley S. F. and Smith G. P. (1988) Antibody-selectable fila- 
mentous fd phage vectors: affinity purification of target genes. 
Gene 73, 305-318. 

Peccoud J., Dellabona P ( Allen P., Benoist C. and Mathis D. 
(1990) Delineation of antigen contact residues on an MHC 
class II molecule. EMBO J. 13, 4215-4223. 

Porcelli S. t Morita C. T. and Brenner M. B. (1992) CD 10 
restricts the response of human CD4"8" T lymphocytes to a 
microbial antigen. Nature 360, 593-597. 
. Rao A., Faas S. J. and Cantor H. (1984a) Activation specificity 
of arsonate-reactive T cell clones. /. Exp. Med. 159, 479-494. 

Rao A., Ko W. W.-P., Faas S. J. and Cantor H. (1984b) Binding 
of antigen in the absence of histocompatibility proteins by 
arsonate-reactive T-cell clones. Cell 36, 879-888. 

Sambrook J. f Fritsch E. F. and Manjatis T. (1989) Molecular 
Cloning: a Laboratory Manual. Cold Spring Harbor Lab- 
oratory Press, New York. 

Scott D. L. ( Otwinowski Z., Gclb M. H. and Sigler P. B. (1990) 
Crystal structure of bee-venom pbospholipase A 2 in a com- 
plex with a transition-state analogue. Science 250, 1 563-1 566. 



A phage display system for detection of T sell receptor-antigen interactions 



1397 



Siliciano R. F., Hemesath T*. J.. Pratt J. C. Dintzis R. Z., 

Dintzes H. M.. Acuto O., Shin H. S. and Reinherz E. L. 

(1986) Direct evidence for the existence of nominal antigen 

binding sites on T cell surface Ti ct-fi heterodimers of MHC- 

restrictedTcell dones. CW/47, 161-171. 
Smith G. P. (1985) Filamentous fusion phage: Novel expression 

vectors that display cloned antigens on the virion surface. 

Science 228, 1315-1317. 
White J., Herman A. t Pullen A. M., Kubo R. f Kappler J. 

W. and M arrack P. (1989) The V beta- specific supcrantigen 



imiphylococcal enterotoxin B: stimulation of mature T cells 
and donaJ deletion in neonatal mice. Cell 36, 27-35. 

White J. t Pullen A., Choi K., Marraclt P. and Kapplcr J. W. 
(1993) Antigen recognition properties of mutant V/T3* T 
cell receptors are consistent with an immunoglobulin-Iike 
structure for the receptor. /. exp. Med. I77 f 1 19-125. 

Winter G M Griffiths A. D., Hawkins R. E. and Hoogenboom 
H. R. (1994) Making antibodies by phage display technology. 
A, Rev. Immun. 12; 433-455. 



Vol. 91, pp. 12654-12658, December 1994 
Immunology 

Functional three-domain single-chain T-cell receptors 

Shan Chung^, Kai W. Wucherpfennig1\ Steven M. Friedman*, David A. HaflerU, 
and Jack L. Strominger^ 

^Department of Biochemistry and Molecular Biology, Harvard University, Cambridge, MA 02138; ^Department of Medicine, Cornell University Medical 
College, New York, NY 10021; and 'Center for Neurologic Diseases. Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 

Contributed by Jack L. Strominger, July 19, 1994 



ABSTRACT T-cell receptors (TCRs) are membrane an- 
chored heterodimers structurally related to antibody mole- 
cules. Single-chain antibodies can be engineered by linking the 
two variable domains, which fold properly by themselves. 
However, proper assembly of the variable domains of a human 
TCR (V„ and Vp) that recognize the HLA-DR2b/mye!in basic 
protein-(85-99) peptide complex was critically dependent on 
the addition of a third domain, the constant region of the TCR 
P chain (Cp), to the single-chain construct. Single-chain mol- 
ecules with the three-domain design, but not those with the 
two-domain design, expressed in a eukaryotic cell as chimeric 
molecules linked either to glycosyl phosphatidyiinositol or to 
the transmembrane/cytoplasmic domains of the CD3 f chain 
were recognized by a conformation-sensitive monoclonal anti- 
body. The chimeric three-domain single-chain TCR linked to 
CD3 f chain signaled in response to both the specific HLA- 
DR/peptide and the HLA-DR/superantigen staphylococcal 
enterotoxin B complexes. Thus, by using this three-domain 
design, functional single-chain TCR molecules were expressed 
with high efficiency. The lipid-linked single-chain TCR was 
sohibilized by enzymatic cleavage and purified by affinity 
chromatography. The apparent requirement of the constant 
domain for cooperative folding of the two TCR variable 
domains may reflect significant structural differences between 
TCR and antibody molecules. 



T-cell receptor (TCR) recognition of antigen fragments pre- 
sented by major histocompatibility complex (MHC) mole- 
cules is a critical step in the initiation of a specific immune 
response (1, 2). The TCR a and 0 chains are each composed 
of two immunoglobulin-Iike domains; most of the amino acid 
residues that are found to be highly conserved in the variable 
(V) region of immunoglobulins are also found in TCR V 
regions, suggesting that the tertiary structure of the TCR may 
resemble that of immunoglobulins (3, 4). However, TCR V 
regions have significantly more primary sequence variability, 
an increased apparent rate of divergence in phylogeny, and 
peaks of variability in addition to those noted in immuno- 
globulins (5, 6). In order to understand and control the 
molecular interactions underlying T-cell recognition of 
MHC/peptide complexes; complete structural knowledge of 
the TCR is required. 

Several approaches have been employed to produce sol- 
uble, recombinant TCRs. In these recombinant TCR mole- 
cules, the transmembrane/cytoplasmic regions of a and j3 
chains were replaced with sequences from lipid-linked pro- 
teins (7), the CD3 f chain (8), or immunoglobulins (9, 10). 
Soluble TCRs were either recovered as secreted proteins or 
obtained by enzymatic cleavage of the surface-expressed 
recombinant proteins. All of these approaches rely, however, 
on the assembly of the heterodimer, which is inefficient (11). 
In addition, high-level expression of the human TCR a chain 
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in transfected eukaryotic cells is not stable. These problems 
can be avoided by the design of a single-chain (sc) recombi- 
nant protein in which the V regions of the heterodimer are 
joined by a short peptide linker. Such a design has been 
successfully applied to antibody molecules (12). Such recom- 
binant molecules, scFv, have a specificity and affinity similar 
to that of native antibodies (12). Several reports have de- 
scribed the production of scTCRs in bacterial expression 
systems using the sc antibody (Fv) design (13-16), but none 
have presented functional data indicating that these scTCRs 
could recognize their MHC/peptide complexes or superan- 
tigens. Recently, however, the production in bacteria of a 
scTCR with the two-domain Fv design that could recognize 
its natural ligands has been reported, although the fraction 
correctly refolded was extremely low (17). 

In the present report, different scTCR designs were eval- 
uated in transfected eukaryotic cells with respect to surface 
expression of TCR molecules, proper folding, and recogni- 
tion of the appropriate MHC/peptide ligand. A three-domain 
sc construct [or-chain V (Vji-Iinker-0-chain V (V^/J-chain 
constant (C^)] was stably expressed on the cell surface when 
linked to a glycosyl phosphatidyiinositol (GPI) anchor and 
recognized by a conformation-dependent monoclonal anti- 
body (mAb) specific for the V^17 segment. The soluble form 
of this recombinant protein could be readily obtained by 
enzymatic cleavage with phosphatidylinositol-specific phos- 
pholipase C (PI-PLC). Replacement of the GPI domain with 
the cytoplasmic portion of the f chain resulted in a functional 
TCR molecule that transduced an intracellular signal follow- 
ing recognition of either the proper MHC/peptide or the 
MHC/staphylococcal enterotoxin B (SEB) superantigen 
complex. The production of a functional scTCR directly 
demonstrates the feasibility of employing sc design to pro- 
duce soluble TCRs. 

MATERIALS AND METHODS 

Construction of Recombinant TCR Molecules. cDNAs of 
TCR a and /3 chains were prepared from mRNA of Hy.2H9 
cells (18) with Superscript reverse transcriptase (BRL) and an 
oligo(dT) primer (Sigma) and were amplified by PCR using 
Vent DNA polymerase (New England Biolabs) and primers 
5'-GCTCGAGGCGGCGATGGAAACTCTCCTGGGAGT-3' 
(A5) and 5'-GGAATTCAGCTGGACCACAGCCGC-3' for 
a-chain and 5'-GCTCGAGCTCTGCCATGGACTCCTG- 
GA-3' and 5 ' -GG AATTC AG AA ATCCTTTCTCTTG AC-3 ' 
for 0-chain, The cDNAs were cloned into the mammalian 
expression vector pBJ-neo (8). GPI-anchored TCR molecules 
(a-PI and 0-PI) were constructed as follows. A Ban I site was 



Abbreviations: TCR, T-cell receptor; MHC, major histocompatibil- 
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12654 



A IKS*-. J >Utl. At t_> <~ t. WWn ^4 |I/^T/ 



engineered after the fifth amino acid residue beyond the last 
cysteine by oligonucleotide-directed mutagenesis. The region 
3' of the Ban I site was then replaced with a Ban 1-Not I 
fragment encoding the GPI signal domain from the human 
placental alkaline phosphatase. For the construction of vari- 
ous scTCRs, variable domains of the 2H9 TCR a and p chains 
were prepared by PCR using primers A5 and 5'-CAGAGCT- 
CACGG ATGAACA ATAAGGCTGGT-3 ' for the V a domain 
in all the scTCR constructs, 5'-TCGGATATCGATGGTG- 
GAATCACTCAGTCC-3' (B5) and 5 ' -C AG AG ATC AG- 
CACGGTGAGCCGGTTCCC-3' for the V fi domain in AB- 
PM, 5 '-GTGGG AG ATCTCTGCTTCTG ATGGCTC AAAC 
and B5 for the V„ domain in AB-PI-2, 5'CACGGATC- 
CCCGTCTGCTCTACCCCAGGC and B5 for the and 
domains in ABC-PI, and 5'-CACGGATCCCCGTCTGCTC- 
TACCCCAGGC-3' and B5 for the and C p domains in 
ABC-f. The cDNA encoding the transmembrane and cyto- 
plasmic domains of murine CD3 £ chain (8) was a gift of R. D. 
Klausner (National Institutes of Health). Convenient restric- 
tion sites were engineered at the end of each fragment to aid 
in the assembly of theconstruct. The linker was a 15-amino 
acid motif of GGGGS repeated three times (12) with Sac I at 
the 5' end and EcoRV at the 3' end. Except for <*-PI, all the 
constructs were cloned into pBJ-neo, which carries the G418- 
resistance gene. a-PI was cloned into pCEP-4 (Invitrogen), 
which bears the hygromycin-resistance gene. All constructs 
were verified by multiple restriction digests and by sequencing 
with the Sequenase kit (United States Biochemical). 

Affinity Purification and Characterization of a Soluble 
Three-Domain scTCR. After transfection and G418 selection 
(8), cells expressing a high level of GPMinked three-domain 
scTCR (ABC-PI) were isolated by three rounds of sorting. 
The resulting cells were grown in spinner culture to a density 
of 10 6 per ml and harvested by centrifugation. The pellet was 
washed twice with phosphate-buffered saline (PBS) and 
resuspended in PBS containing 2 mM Pefabloc (Centerchem, 
Stamford, CI) to a density of 5 x 10 7 per ml with PI-PLC 
(Sigma) added at 1 unit/ml. Cells were incubated at 37°C for 

1 hr with constant rocking. The supernatant was collected by 
centrifugation and by passage through a 0.45- /xm filter and 
applied to a column of Acti-gel (Sterogen, Arcadia, CA) with 
immobilized The column was washed with 10 volumes 
of PBS and the soluble TCR was eluted with 0.15 M glycine 
(pH 2.8). Fractions were immediately neutralized with 0.1 
volume of saturated Tris. The soluble TCR was then dialyzed 
against >100 volumes of PBS at 4°C with at least four changes 
and concentrated to 0.5 rag/ml by vacuum dialysis against 
PBS. Five micrograms of purified soluble three-domain 
scTCR was analyzed by SDS/PAGE under reducing condi- 
tions. 

Stimulation of Transfectants with Antibodies, SEB, and 
Peptide/MHC Ligands. ABC-f-transfected BW51470 *~p~ 
(19) cells (5 x 10 4 per well) were cultured in a 96-well 
round-bottom plate to which various antibodies had been 
immobilized (1 /ig per well)* The supernatants were collected 
after 24 hr and interleukin 2 (IL-2) production was assessed 
in a bioassay using an IL-2-dependent cell line (CTLL) and 
the CellTiter-96 nonradioactive proliferation assay 
(Promega). In the case of ABC-f-transfected RBL-2H3 (8) 
cells, the cells were incubated with Pfflserotonin (NEN) at 
0.5 /iCi (18.5 kBq) for 24 hr before they were added to the 
antibody plate. After incubation at 37°Cfor 2 hr, radioactivity 
released into the supernatant was measured in a liquid 
scintillation counter. The specific serotonin release was 
calculated as described (8). For SEB stimulation, 5 x 10* 
transfected cells per well were cultured with various con- 
centrations of SEB (Toxin Technology, Sarasota, FL) in the 
presence or absence of 2 x 10 5 B cells. For antigen presen- 
tation, 5 x 10 4 transfected cells per well were cocultured with 

2 x 10 5 B cells which were incubated with or without the 



myelin basic protein (MBPM85-99) peptide for 3 hr before 
the experiment. The assays were conducted as described 
above. 

RESULTS 

mAb CI Recognizes a Conformational Epitope of TCR. 

Recombinant TCR molecules were generated by using the 
TCR of- and 0-chain sequences of the human MBP-specific 
T-cell clone Hy.2H9 (18). This clone TCR is composed of the 
V«3.1 and Vpl7,l segments and is specific for the immuno- 
dominant MBP peptide MBP-(85-99) in the context of HLA- 
DR2 (DRA, DRB1*1602) (18). Usage of the V p 17.1 segment 
allowed the proper folding of recombinant TCRs to be probed 
with the superantigen SEB (20) and the mAb CI (21). To 
confirm the V^17 specificity of CI, the extracellular domains 
of TCR a and p chains of Hy.2H9 cells were fused to the 
C-terminal sequence from human placental alkaline phospha- 
tase for GPI anchorage (Fig. 1) and the DNAs encoding the 
GPI-anchored ft and a chains (£-PI and ot-PI) were sequen- 
tially transfected by electroporation (8) into a TCR a- and 
^-chain-deficient murine lymphoma cell line, BW5147 aTf$~ 
(BW~) (19). The surface expression of the GPI-anchored 
TCR chains was monitored by staining with mAbs aFl (22), 
0F1 (23), and CI. aFl and 0F1 recognize nonconformational 
epitopes located in the C region of the TCR a and P chains, 
respectively. Surface expression of the GPI-anchored TCR p 
chain is independent of heterodimer formation and assembly 
of the CD3 complex (7). In the /5-PI-transfected cells (Fig. 2, 
open curves), high-level expression of /3-PI was confirmed by 
staining with 0F1. Interestingly, there was little CI staining 
of these transfectants. However, when a GPI-anchored 2H9 
or chain was supertransfected into the 0-PI transfectant (Fig. 
2, shaded curves), CI reactivity was greatly increased while 
the level of /3F1 staining remained constant. Thus, the 
Vpl7- specific CI epitope is conformational and dependent on 
the proper pairing of TCR a and p chains and can therefore 
be used to assess the proper folding of recombinant TCRs 
bearing a V^17 sequence. 
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Fio. 1. Schematic representation of TCR or- and 0-chain genes 
and various chimeric constructs. S — S, disulfide bond; L, leader; V, 
variable segment; J t joining segment; C, constant region; TM, 
transmembrane region; Cy, cytoplasmic region; ATG, start codon; 
Li, 15-residue peptide linker containing three repeats of GGGGS; PI, 
GPI domain of human placental alkaline phosphatase with the 
sequence LAPPAGTTDAAHPGRSWPALLPLLAGTLLLL (7). 
The ( region contains transmembrane and cytoplasmic domains of 
the murine CD3 ( chain starting at position 31 (8). 
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Fig. 2. Compression of a-PI and 0-PI are required for CI 
reactivity. Flow cytomeric analysis of 0-PI-transfected BW~ cells 
stained with mAb aFl, 0F1, or CI before (open curves) and after 
(shaded curves) the supertransfection of the a-PI construct. 

High-level expression of the TCR a chain (a-PI) was, 
however, not stable either alone or in the presence of 0-PI. 
Attempts were made on several cell lines, including COS-7, 
CHO-K1, and a TCR-deficient variant of Jurkat cells, JK-£~ 
(J.RT3-T3.5, American Type Culture Collection). The ex- 
pression level of a-PI was comparable to that of 0-PI after the 
initial drug selection, but continued culture for less than a 
month yielded a population of cells with little surface ex- 
pression of a-PI, whereas £-PI expression was stable (data 
not shown). The inability to obtain cell lines with stable 
high-level expression of the Pl-anchored human TCR a-chain 
has been reported by other laboratories as well (24). 

Expression and Purification of a Three-Domain scTCR. To 
overcome the limitations set by the unstable expression of the 
human TCR a chain, various sc designs were examined. 
Initially, a design similar to that of sc antibodies (Fv) was 
chosen (12). A 15-residue flexible linker was used to link the 
C terminus of the V*-!* domain to the N terminus of the 0 
chain. The GPI domain was then ligated to the C terminus of 
the Vp-Jp domain. The construct (AB-PI-1, Fig. 1) was 
transfected into several cell lines, including JK-j3", COS-7, 
CHO-K1, and BW". Although the expression of the gene was 
confirmed by the detection of the correct RNA transcripts 
(Fig. 3B), no surface expression was detected, as evidenced 
by negative CI antibody staining (Fig. 3A). Immunoprecip- 
itation after metabolic labeling failed to recover any Cl- 
reactive sc molecules from these transfectants (data not 
shown). The inability to identify any Cl-reactive protein 
could have been due to the design of this molecule, such as 
insufficient linker length between the extracellular domain 
and the GPI domain. To improve the accessibility of the sc 
construct, another two-domain scTCR was designed in which 
an extra 30-amino acid portion of the N terminus of the Cp 
domain was added as a hinge region. The transfectants of this 
construct (AB-PI-2, Fig. 1) were still not reactive with the CI 
antibody (data not shown). Finally, the entire C^ domain was 
added to the sc construct. A complete C^ domain should 
provide enough distance for the V^Vp domains to be ex- 
pressed on the cell surface and, more importantly, should 
allow surface expression to be monitored with another anti- 
body, pFl (23). This three-domain scTCR was constructed 
by extending the TCR £-chain sequences to the residue right 
before the last cysteine (the sixth cysteine), which was then 
fused to the GPI domain. The last cysteine was deleted to 
prevent dimerization between C^ domains. This construct 
(ABC-PI, Fig. 1) was transfected into B W~ cells and surface 
expression was confirmed by staining with both 0F1 and CI 
(Fig. 3C, shaded curves). Both antibodies stained the cells 
with comparable efficiency, suggesting that most of the 
molecules were expressed in the correct conformation. 
Moreover, the molecule could be efficiently cleaved from the 
cell surface with PI-PLC (Fig. 3C, open curves). 

Soluble three-domain scTCR was purified from transfec- 
tants after PI-PLC cleavage followed by affinity chromatog- 
raphy using the 0F1 antibody. The purified three-domain 
scTCR appeared as multiple bands at 50-70 kDa after SDS/ 
PAGE (Fig. 3D). The heterogeneity of scTCR is probably the 
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Flo. 3. Flow cytomeric analysis of surface expression of scTCR 
constructs. (A) Lack of CI reactivity in BW~ cells transfected with 
two-domain scTCR (AB-PI-1). (B) RNA analysis of poIy(A>enriched 
total cell RNA isolated from BW~ cells transfected with two-domain 
and three-domain scTCR constructs AB-PI-1 and ABC-PI, respec- 
tively. Samples were analyzed in a nuclease SI protection assay (25) 
using probes specific for the. 5' end of the transcripts from the TCR 
constructs. The coexpressed heo (G418-resistance gene) transcripts 
from the vector were analyzed with a probe at the same time as a 
control. (C) Flow cytometric analysis of ABC-PI-transfected BW~ 
cells with both fiFl and CI antibodies before (shaded curves) and 
after (open curves) PI-PLC treatment. (Z)) SDS/PAGE of affinity- 
purified three-domain scTCR. (E) Comparison of CI reactivity of 
three-domain scTCRs produced from eukaryotic (Euk) and prokary- 
otic (Prok) expression systems in a two-antibody ELISA. A plateau 
is reached because the amount of 0F1 attached to the plate became 
limiting. 

result of variable glycosylation; its polypeptide size calcu- 
lated from amino acid composition is 40 kDa. The structural 
integrity of the three-domain scTCR was verified by a two- 
antibody ELISA (Fig. 3£). The molecules were first captured 
by the 0F1 antibody immobilized to the plate and then 
assessed for reactivity with the CI antibody. When compared 
with the three-domain scTCR produced in a bacterial expres- 
sion system (unpublished work), the scTCR from the eukary- 
otic system gave 10-20 times higher CI reactivity. The 
purified three-domain scTCR was stable and could be stored 
in PBS at 4°C for months without significant loss of CI 
reactivity. 

Functional Characterization of a Chimeric Three-Domain 
scTCR. To directly assess the functional integrity of the 
three-domain scTCR, a self-signaling scTCR was produced 
by replacing: the GPI domain with the transmembrane and 
cytoplasmic domains of the CD3 £ chain. These regions have 
been shown to be sufficient for signal transduction when its 
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extracellular fusion partner is crosslinked by an antibody or 
by the proper ligand (8, 26, 27). To enable the recovery of 
three-domain scTCR as a soluble form, a linker containing a 
thrombin cleavage site was inserted into the junction of 
three-domain scTCR and the C domain. The construct 
(ABC-0 was transfected into BW~ cells (28) and the rat 
basophilic leukemia cell line RBL-2H3 (RBL) (8), and the 
populations displaying high-level expression of three-domain 
scTCR were isolated by three rounds of cytofluorometric 
sorting using the antibody /3F1. The ABC- (- transfected cells 
were first stimulated with various antibodies to confirm the 
self-signaling nature of this recombinant molecule. The signal 
transduced upon the activation of the three-domain scTCR 
was measured as IL-2 production in BW~ transfectants, 
whereas serotonin release was measured in RBL transfec- 
tants. Both transfectants showed a strong response following 
pFl and CI stimulation but not to purified mouse immuno- 
globulin or anti-CD8 antibody used as controls (Fig. 4 A and 
B). The structural integrity of the scTCR was further exam- 
ined with the superantigen SEB, which binds to both Vpll 



and MHC class II molecules f resulting in TCR crosslinking 
and T-cell activation regardless of the peptide bound to the 
MHC molecule (20, 29). ABC-f transfectants displayed a 
concentration-dependent response toward SEB (Fig. 4C) 
when the superantigen was presented by transformed B-cell 
lines with high-level expression of DR1 (DRA, DRB 1*0101; 
cell line LG2) or DR2 (DRA, DRBP1602; cell line 9016). 
Thus, the lateral face of the TCR V> region to which SEB is 
thought to bind (30) is structurally intact. 

To prove that the three-domain scTCR did indeed recog- 
nize the MHC/peptide ligand, antigen presentation experi- 
ments using the natural ligand for the Hy.2H9 clone, 9016 
cells bearing the DRB1*1602 allele of DR2, and MBP-(85-99) 
peptide were performed. To ensure detection of subtle ab- 
normalities in the structure of the three-domain scTCR, 9009 
cells (DRA, DRB 1*1601), which also bind MBP-(85-99), 
were used as a control. DRB1*1601 and DRB1*1602 differ 
only at position 67 in the DR^l domain; this TCR contact- 
residue substitution does, however, abolish recognition of 
the peptide by the parent T-cell clone (ref. 18; K.W.W., 
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Fig. 4. (A) [ 3 H]Serotonin release following TCR crosslinking of three-domain scTCR-f chimeric construct (ABC-0-transfected RBL cells. 
CI is specific to Vpl7 and £F1 is specific to the C region of TCR p chain, mlg, mouse immunoglobulin. (B) IL-2 release following TCR crosslinking 
of ABC-f-transfected BW~ cells. (C) [ 3 H]Serotonin release following SEB stimulation of ABC-C-transfected RBL ceUs. (D) MHC-restricted 
antigen-specific response from the ABC- f- transfected BW~ cells. (E) Dose-response curve of MHC/peptide recognition of ABC-^-transfected 
RBL cells. 
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unpublished work). The ABC-f-transfected BW" cells se- 
creted IL-2 in response to peptide-pulsed 9016 cells, but not 
to peptide-pulsed 9009 cells (Fig. 4D). Similar results were 
obtained with RBL transfectants (Fig. 4E), as serotonin 
release was dependent on the concentration of the MBP 
peptide used to pulse 9016 cells. The signal appeared to be 
weak when compared with antibody stimulation. This is not 
surprising, however, since saturating amounts of antibodies 
are expected to crosslink the majority of TCR molecules on 
the target cell, whereas a much smaller fraction of TCR 
molecules is probably engaged when T cells are cocultured 
with peptide-pulsed antigen-presenting cells, in which a max- 
imum of 5-15% of the DR molecules bind the peptide. The 
requirement for a high concentration of peptide or SEB is not 
due to the sc design, since high concentrations of peptide are 
also needed to stimulate afi heterodimers of the TCR-f 
constructs (8). It is likely that the decrease in sensitivity 
results from the lack of CD3, CD4, and/or other adhesion/ 
signaling molecules. Nonetheless, these results demonstrate 
that the three-domain scTCR was correctly folded and func- 
tionally competent. In addition, a soluble form of three- 
domain scTCR could be obtained from the ABC-f transfec- 
tants by thrombin cleavage and affinity purification (data not 
shown). 

DISCUSSION 

A scTCR molecule was designed which contains the V 
domains of both a and /3 chains and the C domain of the 0 
chain. This scTCR molecule could be stably expressed at a 
high level in eukaryotic cells and could be isolated in a soluble 
form by enzymatic cleavage and affinity chromatography. 
The V a and Vp domains appeared to be properly paired, since 
the scTCR bound to a conformation-dependent mAb, the 
superantigen SEB, and the proper MHC/peptide ligand. This 
design of scTCR offers an alternative to the two-chain design 
of soluble TCRs and has several advantages. (/) The sc design 
avoids the low-efficiency dimerization process which may be 
the limiting step in the assembly of TCR heterodimers from 
a and 0 subunits made in Escherichia coli. The sc design 
therefore allows efficient expression of the recombinant 
protein in quantities suitable for structural analysis and for 
some diagnostic or therapeutic applications. (//) The design 
avoids the problems associated with the unstable expression 
of the human TCR a chain that have hindered efficient 
expression of human TCR molecules in eukaryotic cells, (hi) 
The sc design may allow the construction of TCR phage 
display libraries similar to those made for sc antibodies (28, 
31). scTCR phage libraries may be powerful tools for the 
isolation of TCRs with defined specificities and/or high 
affinity for selective targeting of malignant and virally in- 
fected cells and for analyzing the interactions among TCR, 
MHC/peptide complexes, and superantigens. 

Unlike antibodies, separately expressed V domains of TCR 
a and j3 chains have not been reconstituted to form het- 
erodimers (ref. 32; K. L. Hilyard, personal communication). 
However, despite this success in producing the three-domain 
scTCR, a two-domain scTCR with detectable CI reactivity 
could not be produced either in eukaryotic cells or in bacte- 
ria. The presence of the Cp domain (or part of the domain) 
may be required for the proper folding and/or stabilization of 
a scTCR molecule. In any event, the three-domain design 
provides a general means for the efficient production of 
functional scTCR molecules. 
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encapsulating a genome encoding first and second polypep- 
tides of an antogenously assembling receptor, such as an 
antibody, and a receptor comprised of the first and second 
polypeptides surface-integrated into the matrix via a cpVTH 
membrane anchor domain fused to at least one of the 
polypeptides with a mutagenized CDR3 region. 

26 Claims, 12 Drawing Sheets 
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SHINE-DALGARNO MET 

G G C CG C A A ATT CTATTTC A AGG AGACAGT CAT A ATG 
CGTTTA AG ATA A AGTT C CT CTG T CAGT ATT A C 



LEADER SEQUENCE 



'AAATACCTATTGCCTACGGCAGCCGCT 
TTTATGGATAACGGATGCCGTCGGCGA 

LEADER SEQUENCE 

GGATTGTTATTACTCGCTGCCCAACCAG 1 
CCTAACAATAATGAGCGACGGGTTGGTC 

LINKER LINKER 



NCOI 



V H BACKBONE 



XHOI SPEI 



CCATGGCCCAGGTGAAACTG'CTCGAGATTTCTAGACTAGT 
GGTACCGGGTCCACTTTGACGAGCTCTAAAGATCTGATCA 

STOP LINKER 

TyrProTyrAspValProAspTyrAlaSer ; ^ 

TACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTCG 
ATGG G CATG CTGCA AG G CCTG ATG C CA AG A ATT AT CTTA AG CAG CT 
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H ETERODIMERIC RECEIPTOR LIBRARIES 
USING PHAGEMIDS 

CROSS-REFERENCE TO RELATED 
APPLICATION 

This application is a continuation of U.S. patent applica- 
tion Ser. No. 07/826.623. filed on Jan. 27. 1992. now 
abandoned, which is a continuation-in-part of U.S. patent 
application Ser. No. 07/683.602, filed Apr. 10. 1991. now 
abandoned, the disclosures of which are hereby incorporated 
by reference. 

This invention was made with government support under 
Grant No. CA 27489 awarded by the National Institutes of 
Health. 

TECHNICAL FIELD 

The present invention relates to cloning vectors and 
methods for producing a library of DNA molecules capable 
of expressing a fusion protein on the surface of a filamentous 
phage particle. 

BACKGROUND 

Filamentous bacteriophages are a group of related viruses 
that infect bacteria. They are termed filamentous because 
they are long and thin particles comprised of an elongated 
capsule that envelopes the deoxyribonucleic acid (DNA) 
that forms the bacteriophage genome. The F pili filamentous 
bacteriophage (Ff phage) infect only gram-negative bacteria 
by specifically adsorbing to the tip of F pili, and include fd. 
flandM13. 

The mature capsule of Ff phage is comprised of a coat of 
five phage-encoded gene products: cpVUL the major coat 
protein product of gene VHI that forms the bulk of the 
capsule; and four minor coat proteins, cpin and cpIV at one 
end of the capsule and cpVU and cpIX at the other end of the 
capsule. The length of the capsule is formed by 2500 to 3000 
copies of cpVTQ in an ordered helix array that forms the 
characteristic filament structure. About five copies each of 
the minor coat proteins are present at the ends of the capsule. 
The gene Hi-encoded protein (cpM) is typically present in 
4 to 6 copies at one end of the capsule and serves as the 
receptor for binding of the phage to its bacterial host in the 
initial phase of infection. For detailed reviews of Ff phage 
structure, see Rasched et aL. Microbiol Rev^ 50:401-427 
(1986); and Model et al.. in , The Bacteriophages, Volume 
2", R. Calendar, Ed.. Plenum Press, pp. 375-456 (1988). 

The assembly of a Ff phage particle involves highly 
complex mechanics. No phage particles are assembled 
within a host cell; rather, they are assembled during extru- 
sion of the viral genome through the host cell's membrane. 
Prior to extrusion, the major coat protein cpVUI and the 
minor coat protein cpm are synthesized and transported to 
the host cell's membrane. Both cpVUI and cpm are 
anchored in the host cell membrane prior to their incorpo- 
ration into the mature particle. In addition, the viral genome 
is produced and coated with cpV protein. During the extru- 
sion process. cpV-coated genomic DNA is stripped of the 
cpV coat and simultaneously recoated with the mature coat 
proteins. The assembly mechanisms that control transferral 
of these proteins from the membrane to the particle is not 
presently known. 

Both cpm and cpVTH proteins include two domains that 
provide signals for assembly of the mature phage particle. 
The first domain is a secretion signal that directs the newly 
synthesized protein to the host cell membrane. The secretion 
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signal is located at the amino terminus of the protein and 
targets the protein at least to the cell membrane. The second 
domain is a membrane anchor domain that provides signals 
for association with the host cell membrane and for asso- 

5 ciation with the phage particle during assembly. This second 
signal for both cpVUI and cpIH comprises at least a hydro- 
phobic region for spanning the membrane. 

cpVUE has been extensively studied as a model membrane 
protein because it can integrate into lipid bilayers such as the 

10 cell membrane in an asymmetric orientation with the acidic 
amino terminus toward the outside and the basic carboxy 
terminus toward the inside of the membrane. The mature 
protein is about 50 amino acid residues in length of which 
11 residues provide the carboxy terminus, 19 residues pro- 

15 vide the hydrophobic transmembrane region, and the 
remaining residues comprise the amino terminus. Consid- 
erable research has been done on the secretion signal region 
of cpVTH to advance the study of membrane protein syn- 
thesis and targeting to membranes. However, little is known 

20 about the changes that are tolerated in the structure of the 
cpVUI membrane anchor region that would allow for assem- 
bly of phage particles. 

Manipulation of the sequence of cpIH shows that the 
C- terminal 23 amino acid residue stretch of hydrophobic 

25 amino acids normally responsible for a membrane anchor 
function can be altered in a variety of ways and retain the 
capacity to associate with membranes. However, those 
anchor-modified cpni proteins lost their ability to geneti- 
cally complement gene HI mutants indicating that the 

30 requirements of a membrane anchor for functional assembly 
have not been elucidated. 

Ff phage-based expression vectors have been described in 
which the entire cpm amino acid residue sequence was 
modified by insertion of short polypeptide "epitopes" 
(Parmely et aL, Gene, 73:305-318 (1988); and Cwirla et al.. 
Proc. Natl Acad. ScL USA. 87:6378-6382 (1990)] or an 
amino acid residue sequence defining a single chain anti- 
body domain. McCafferty et al.. Science, 348:552-554 

^ (1990). These hybrid proteins were synthesized and 
assembled onto phage particles in amounts of about 5 copies 
per particle, a density at which normal cpin is usually found. 
However, these expressed fusion proteins include the entire 
cpm amino acid residue sequence and do not suggest fusion 

45 proteins that utilize only the carboxy terminal membrane 
anchor domain of cpm. 

In addition, no expression system has been described in 
which a phage coat protein has been engineered to allow 
assembly of a heteromeric molecule that is functional and 

50 capable of incorporation into the coat of a phage particle. 

BRIEF SUMMARY OF THE INVENTION 

A new. high density, surface-integration technology has 
been discovered for expressing a recombinant gene product 

55 on the surface of a filamentous phage containing the recom- 
binant gene. The invention uses a filamentous phage cpvm 
membrane anchor domain as a means for linking gene- 
product and gene during the assembly stage of filamentous 
phage replication. 

60 That is. during filamentous phage replication, gene VTH- 
encoded proteins assemble into a matrix which encapsulates 
the phage genome. It has now been discovered that (1) phage 
assembly is not disrupted when recombinant gene VTH- 
encoded proteins are present (2) recombinant gene Vffl- 

65 encoded proteins can be integrated into the assembling 
matrix, and (3) integration into the matrix can be directed to 
occur in a surface-accessible orientation. 
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The present invention can be advantageously applied to 
the production of heteromeric receptors of predetermined 
specificity, i.e., it can be used to produce antibodies. T-cell 
receptors and the like that bind a preselected ligand. 

Thus, the present invention provides for linking the func- 
tions of heteromeric receptor recognition and filamentous 
phage replication in a method for isolating a heteromeric 
receptor. The method produces a filamentous phage com- 
prised of a matrix of gene Vm-e needed proteins that encap- 
sulate a recombinant genome. The recombinant genome 
contains genes encoding the proteins of the heteromeric 
receptor proteins. The heteromeric receptor is surface- 
integrated into the encapsulating matrix via a gene VIII- 
encoded membrane anchor domain that is fused by a peptide 
bond during translation to one of the heteromeric receptor 
proteins. The heteromeric receptor and the genes which 
encode it are physically linked during the assembly stage of 
the phage replication cycle. Specifically binding the 
receptor-coated phage to a solid- support advantageously 
provides a means for isolating a recombinant genome that 
encodes a desired heteromeric receptor from a diverse 
library of recombinant genomes. 

In one embodiment, the present invention contemplates 
an antibody molecule comprising heavy- and light-chain 
proteins, said heavy-chain protein comprising a VV do main 
flanked by an ammo-terniinal prokaryotic secretion signal 
domain and a caxboxy-terminal filamentous phage cpVTH 
membrane anchor domain, said light chain protein compris- 
ing a V^-domain fused to an amino-terminal prokaryotic 
secretion signal domain. 

In another embodiment, the present invention contem- 
plates a vector for expressing a fusion protein, said vector 
comprising a cassette that includes upstream and down- 
stream translatable DNA sequences operatively linked via a 
sequence of nucleotides adapted for directional ligation of 
an insert DNA. said upstream sequence encoding a prokary- 
otic secretion signal, said downstream sequence encoding a 
filamentous phage gene cpVUI membrane anchor, said 
translatable DNA sequences operatively linked to a set of 
DNA expression signals for expression of said translatable 
DNA sequences as portions of said fusion protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 
In the drawings forming a portion of this disclosure: 
FIG. 1 illustrates a schematic diagram of the immunoglo- 
bulin molecule showing the principal structural features. 
The circled area on the heavy chain represents the variable 
region (V w ), a polypeptide containing a biologically active 
(ligand binding) portion of that region, and a gene coding for 
that polypeptide, are produced by the methods of the present 
invention. 

FIG. 2A is a diagrammatic sketch of a heavy (H) chain of 
human IgG (IgGl subclass). Numbering is from the 
N-terminus on the left to the C-ternainus on the right Note 
the presence of four domains, each containing an intrachain 
disulfide bond (S — S) spanning approximately 60 amino 
acid residues. The symbol CHO stands for carbohydrate. 
The V region of the heavy (H) chain (V w ) resembles V c in 
having three hypervariablc CDR (not shown). 

FIG. 2B-1 is a diagrammatic sketch of a human light 
(Kappa) chain (Panel 1). Numbering is from the N-teiminus 
on the left to the C-terminus on the right Note the intrachain 
disulfide bond (S — S) spanning about the same number of 
amino acid residues in the V L and C L domains. 

FIG. 2B-2 shows the locations of the complementarity- 
deterrriining regions (CDR) in the V A domain. Segments 
outside the CDR arc the framework segments (FR). 
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FIG. 3 illustrates the sequence of the double-stranded 
synthetic DNA inserted into Lambda Zap to produce a 
Lambda Hc2 expression vector. The preparation of the 
double- stranded synthetic DNA insert is described in 

5 Example la(ii). The various features required for this vector 
to express the V^oding DNA homologs include the Shine- 
Dai gamo ribosome binding site, a leader sequence to direct 
the expressed protein to the periplasm as described by 
Mouva et ah. J. BioL Cherry 255:27. 1980, and various 

0 restriction enzyme sites used to operatively link the V w 
homologs to the expression vector. The V H expression 
vector sequence also contains a short nucleic acid sequence 
that codes for amino acids typically found in variable 
regions heavy chain (V w Backbone). This V„ Backbone is 

15 just upstream and in the proper reading as the V w DNA 
homologs that are operatively linked into the Xho I and Spe 
I cloning sites. The sequences of the top and bottom strands 
of the double-stranded synthetic DNA insert are listed 
respectively as SEQ. ID. NO. 1 and SEQ. ID. NO. 2. The 

m synthetic DNA insert is directionally ligated into Lambda 
Zap II digested with the restriction enzymes Not 1 and Xho 
I to form Lambda Hc2 expression vector. 

FIG. 4 illustrates the major features of the bacterial 
expression vector Lambda Hc2 (V H expression vector). The 
synthetic DNA sequence from FIG. 3 is shown at the top 
along with the T 3 polymerase promoter from Lambda Zap H 
The orientation of the insert in Lambda Zap II is shown. The 
V w DNA homologs are inserted into the Xho I and Spe I 
cloning sites. The read through transcription produces the 

3Q decapeptide epitope (tag) that is located just 3' of the cloning 
site. 

FIG. 5 illustrates the sequence of the double-stranded 
synthetic DNA inserted into Lambda Zap to produce a 
Lambda Lc2 expression vector. The various features 

35 required for this vector to express the V^-coding DNA 
homologs are described in FIG. 3. The V L -coding DNA 
homologs are operatively linked into the Lc2 sequence at the 
Sac I and Xho I restriction sites. The sequences of the top 
and bottom strands of the double-stranded synthetic DNA 

40 insert are listed respectively as SEQ. ID. NO. 3 and SEQ. ID. 
NO. 4. The synthetic DNA insert is directionally ligated into 
Lambda Zap II digested with the restriction enzymes Sac I 
and Not I to form Lambda Lc2 expression vector. 

FIG. 6 illustrates the major features of the bacterial 

45 expression vector Lc2 (V L expression vector). The synthetic 
DNA sequence from FIG. 5 is shown at the top along with 
the T 3 polymerase promoter from Lambda Zap n. The 
orientation of the insert in Lambda Zap II is shown. The 
DNA homologs are inserted into the Sac I and Xho I cloning 

50 sites. 

FIG. 7 illustrates the didstronic expression vector. 
pComb. in the form of a phagemid expression vector. To 
produce pComb. phagemids were first excised from the 
expression vectors. Lambda Hc2 and Lambda Lc2. using an 

55 in vivo excision protocol according to manufacturers 
instructions (Stratagene. La Jolla, Calif.). The pComb 
expression vector is prepared from Lambda Hc2 and 
Lambda Lc2 which do not contain V^coding or V^- coding 
DNA homologs. The in vivo excision protocol moved the 

60 cloned insert from the Lambda Hc2 and Lc2 vectors into a 
phagemid vector. The resultant phagemids contained the 
same nucleotide sequences for antibody fragment cloning 
and expression as did the parent vectors. Hc2 and Lc2 
phagemid expression vectors were separately restriction 

65 digested with Sea I and EcoR L The linearized phagemids 
were ligated via the Sea I and EcoR I cohesive termini to 
form the dicistronic (combinatorial) vector, pComb.- 
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FIG. 8 illustrates a schematic diagram of the composition 
of pCBAK8-2b phagemid vector, the pathway for Fab 
assembly and incorporation in phage coaL The vector carries 
the chloramphenicol acetyl transferase (CAT) marker gene 
in addition to the nucleotide residue sequences encoding the 5 
Fd-cpVm fusion protein and the kappa chain. The f 1 phage 
origin of replication facilitates the generation of single 
stranded phagemid. The isopropyl thiogalactopyranoside 
(TPTG) induced expression of a dicistronic message encod- 
ing the Fd-cpVUI fusion (V„. C Hl . cpVm) and the light 10 
chain (V L . Q) leads to the formation of heavy and light 
chains. Each chain is delivered to the periplasmic space by 
the pelB target sequence, which is subsequently cleaved. 
The heavy chain is anchored in the membrane by cpVIQ 
fusion while the light chain is secreted into the periplasm. 15 
The heavy chain in the presence of light chain assembles to 
form Fab molecules. The Fabs are incorporated into phage 
particles via cpVm (black dots). 

FIG. 9 illustrates the electron rmcrographic localization of 
5-7 nm colloidal gold particles coated with NPN-BSA 20 
conjugate along the surface of filamentous phage, and from 
phage emerging from a bacterial cell. Panel 9A shows 
filamentous phage emerging from the surface of the bacterial 
cell specifically labelled with the colloidal gold particles 
coated with BSA-NPN antigen. Panel 9B shows a portion of 25 
a mature filamentous phage on the length of which is 
exhibited the labelling of antigen binding sites. 

FIG. 10 illustrates the results of a two-site ELJSA for 
assaying for the presence and function of Fab antibody 
attached to the surface of bacteriophage particles as 
described in Example 4b. For expression of Fab antibody on 
phage surfaces. XLl-Blue cells were transformed with the 
phagemid expression vector, pCBAK8-2b. The inducer, 
isopropyl thiogalactopyranoside (IPTG). was admixed with 
the bacterial suspension at a final concentration of 1 mM for 
one hour. Helper phage was then admixed with the bacterial 
suspension to initiate the generation of copies of the sense 
strand of the phagemid DNA. After a two hour maintenance 
period, bacterial supematants containing bacteriophage par- ^ 
tides were collected for assaying in EUSA. 

Specific titra table binding of NFN-Fab-expressing bacte- 
riophage particles to NFN-coated plates was exhibited. No 
binding was detected with helper phage alone. 

FIG. 11 illustrates the inhibition of NPN-Fab expressing 45 
bacteriophage to NPN antigen-coated plates with the addi- 
tion of increasing amounts of free hapten. The assays were 
performed as described in FIG. 10. Complete inhibition of 
binding was observed with 5 ng of added free NPN hapten. 

FIG. 12 illustrates schematically the process of 50 
mutagenizing the CDR3 region of a heavy chain fragment 
resulting in an alteration of binding specificity. The oligo- 
nucleotide primers are indicated by black bars. The process 
is described in Example 6. 

55 

DETAILED DESCRIPTION OF THE 
INVENTION 

A. Definitions 

Amino Acid Residue: An amino acid formed upon chemi- 60 
cal digestion (hydrolysis) of a polypeptide at its peptide 
linkages. The amino acid residues described herein are 
preferably in the "L" isomeric form. However, residues in 
the "D" isomeric form can be substituted for any L-amino 
acid residue, as long as the desired functional property is 65 
retained by the polypeptide. NH 2 refers to the free amino 
group present at the amino terminus of a polypeptide. 
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COOH refers to the free carboxy group present at the 
carboxy terminus of a polypeptide. In keeping with standard 
polypeptide nomenclature (described in J. BioL Ckem^ 
2433552-59 (1969) and adopted at 37 C.FR. 1.822(bX2)). 
abbreviations for amino acid residues are shown in the 
following Table of Correspondence: 

TABLE OF CORRESPONDENCE 



SYMBOL 



1 -Letter 


3-Lcttcr 


AMINO ACID 


Y 


TVr 


tyrosine 


G 


Gly 


glycine 


F 


Fbe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Scr 


serine 


I 


He 


isoleucme 


L 


Leu 


leucine 


T 


Thr 




V 


va 


Y&line 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glu taurine 


£ 


Glu 


glutamk acki 


Z 


Gtx 


Ght and/or Gin 


W 


Tip 


tryptophan 


R 


Arg 




D 


Asp 


aspartic acid 


N 


Asa 


asparagioe 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


J 


Xaa 


Unknown or other 



It should be noted that all amino acid residue sequences 
represented herein by formulae have a left-to-right orienta- 
tion in the conventional direction of amino terminus to 
carboxy terminus. In addition, the phrase "amino acid resi- 
due** is broadly defined to include the amino acids listed in 
the Table of Correspondence and modified and unusual 
amino acids, such as those listed in 37 C.F.R. 1.822(b)(4), 
and incorporated herein by reference. Furthermore, it should 
be noted mat a dash at the beginning or end of an amino acid 
residue sequence indicates a peptide bond to a further 
sequence of one or more amino acid residues or a covalent 
bond to an aniino- terminal group such as NH 2 01 acetyl or 
to a carboxy^erminal group such as COOH. 

Nucleotide: A monomelic unit of DNA ox RNA consisting 
of a sugar moiety (pentose), a phosphate, and a nitrogenous 
heterocyclic base. The base is linked to the sugar moiety via 
the glycosidic carbon (1' carbon of the pentose) and that 
combination of base and sugar is a nucleoside. When the 
nucleoside contains a phosphate group bonded to the 3' or 5* 
position of the pentose it is referred to as a nucleotide. A 
sequence of operatively linked nucleotides is typically 
referred to herein as a "base sequence** or "nucleotide 
sequence", and their grammatical equivalents, and is repre- 
sented herein by a formula whose left to right orientation is 
in the conventional direction of 5'- terminus to 3- terminus. 

Base Pair (bp): A partnership of adenine (A) with thymine 
(T), or of cytosine (C) with guanine (G) in a double stranded 
DNA molecule. In RNA. uracil (U) is substituted for thym- 
ine. 

Nucleic Acid: A polymer of nucleotides, either single or 
double stranded. 

Polynucleotide: a polymer of single or double stranded 
nucleotides. As used herein "polynucleotide" and its gram- 
matical equivalents, will include the full range of nucleic 
acids. A polynucleotide will typically refer to a nucleic add 
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molecule comprised of a linear strand of two or more 
deoxyribonucleotides and/or ribonucleotides. The exact size 
will depend on many factors, which in turn depends on the 
ultimate conditions of use, as is well known in the art The 
polynucleotides of the present invention include primers. 5 
probes. RNA/DNA segments, oligonucleotides or "oligos" 
(relatively short polynucleotides), genes, vectors, plasmids. 
and the like. 

Gene: A nucleic acid whose nucleotide sequence codes for 
an RNA or polypeptide. A gene can be either RNA or DNA. 10 

Duplex DNA: a double-stranded nucleic acid molecule 
comprising two strands of substantially complementary 
polynucleotides held together by one or more hydrogen 
bonds between each of the complementary bases present in 
a base pair of the duplex. Because the nucleotides that form 
a base pair can be either a ribonucleotide base or a deox- 
yribonucleotide base, the phrase "duplex DNA** refers to 
either a DNA-DNA duplex comprising two DNA strands (ds 
DNA). or an RNA-DNA duplex comprising one DNA and ^ 
one RNA strand. 

Complementary Bases: Nucleotides that normally pair up 
when DNA or RNA adopts a double stranded configuration. 

Complementary Nucleotide Sequence: A sequence of 
nucleotides in a single-stranded molecule of DNA or RNA 23 
mat is sufficiently complementary to that on another single 
strand to specifically hybridize to it with consequent hydro- 
gen bonding. 

Conserved: A nucleotide sequence is conserved with 
respect to a preselected (reference) sequence if it non- 30 
randomly hybridizes to an exact complement of the prese- 
lected sequence. 

Hybridization: The pairing of substantially complemen- 
tary nucleotide sequences (strands of nucleic acid) to form 
a duplex or heteroduplex by the establishment of hydrogen 35 
bonds between complementary base pairs. It is a specific. Le. 
non-random, interaction between two complementary poly- 
nucleotides that can be competitively inhibited. 

Nucleotide Analog: A purine or pyrimidine nucleotide 
that differs structurally from A. T. G. C. or U. but is 40 
sufficiently similar to substitute for the normal nucleotide in 
a nucleic acid molecule. 

DNA Homolog: Is a nucleic acid having a preselected 
conserved nucleotide sequence and a sequence coding for a ^ 
receptor capable of binding a preselected ligand. 

Recombinant DNA (rDNA) molecule: a DNA molecule 
produced by operatively linking two DNA segments. Thus, 
a recombinant DNA molecule is a hybrid DNA molecule 
comprising at least two nucleotide sequences not normally ^ 
found together in nature. rDNA's not having a common 
biological origin, i.e.. evolutionarily different, are said to be 
'•heterologous*. 

Vector: a rDNA molecule capable of autonomous repli- 
cation in a cell and to which a DNA segment, e.g.. gene or 55 
polynucleotide, can be operatively linked so as to bring 
about replication of the attached segment Vectors capable of 
directing the expression of genes encoding for one or more 
proteins are referred to herein as "expression vectors". 
Particularly important vectors allow cloning of cDNA ^ 
(complementary DNA) from mRNAs produced using 
reverse transcriptase. 

Receptor A receptor is a molecule, such as a protein, 
glycoprotein and the like, that can specifically (non- 
randomly) bind to another molecule. 65 

Antibody: The term antibody in its various grammatical 
forms is used herein to refer to immunoglobulin molecules 
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and immunologically active portions of immunoglobulin 
molecules. Le.. molecules that contain an antibody combin- 
ing site or paratope. Exemplary antibody molecules are 
intact immunoglobulin molecules, substantially intact 
immunoglobulin molecules and portions of an immunoglo- 
bulin molecule, including those portions known in the art as 
Fab. Fab'. F(ab') 2 and F(v). 

Antibody Combining Site: An antibody combining site is 
that structural portion of an antibody molecule comprised of 
a heavy and light chain variable and hypervariable regions 
that specifically binds (immunoreacts with) an antigen. The 
term immunoreact in its various forms means specific bind- 
ing between an antigenic determinant-containing molecule 
and a molecule containing an antibody combining site such 
as a whole antibody molecule or a portion thereof. 

Monoclonal Antibody: The phrase monoclonal antibody 
in its various grammatical forms refers to a population of 
antibody molecules that contains only one species of anti- 
body combining site capable of immunoreacting with a 
particular antigen. A monoclonal antibody thus typically 
displays a single binding affinity for any antigen with which 
it immunoreacts. A monoclonal antibody may therefore 
contain an antibody molecule having a plurality of antibody 
combining sites, each immunospecific for a different 
antigen, e.g.. a bispecific monoclonal antibody. 

Fusion Protein: A protein comprised of at least two 
polypeptides and a linking sequence to operatively link the 
two polypeptides into one continuous polypeptide. The two 
polypeptides linked in a fusion protein are typically derived 
from two independent sources, and therefore a fusion pro- 
tein comprises two linked polypeptides not normally found 
linked in nature. 

Upstream: In the direction opposite to the direction of 
DNA transcription, and therefore going from 5' to 3' on the 
non-coding strand, or 3' to 5* on the mRNA. 

Downstream: Further along a DNA sequence in the direc- 
tion of sequence transcription or read out that is traveling in 
a 3*- to S'-direction along the non-coding strand of the DNA 
or 5'- to 3-direction along the RNA transcript. 

Cistron: Sequence of nucleotides in a DNA molecule 
coding for an amino acid residue sequence and including 
upstream and downstream DNA expression control ele- 
ments. 

Stop Codon: Any of three codons that do not code for an 
amino acid, but instead cause termination of protein syn- 
thesis. They are UAG, UAA and UGA and are also referred 
to as a nonsense or termination codon. 

Leader Polypeptide: A short length of amino acid 
sequence at the amino end of a protein, which carries or 
directs the protein through the inner membrane and so 
ensures its eventual secretion into the periplasmic space and 
perhaps beyond. The leader sequence peptide is commonly 
removed before the protein becomes active. 

Reading Frame: Particular sequence of contiguous nucle- 
otide triplets (codons) employed in translation. The reading 
frame depends on the location of the translation initiation 
codon. 

B. Filamentous Phage 

The present invention contemplates a filamentous phage 
comprising a matrix of cpvm proteins encapsulating a 
genome encoding first and second polypeptides. The phage 
further comprises a heteromeric receptor comprised of the 
first and second polypeptides surface-integrated into the 
matrix via a cpVTH membrane anchor domain fused to at 
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least one of the first or second polypeptides. Preferably, the 
first and second polypeptides are V„ and proteins, 
respectively. 

The first and second polypeptides are capable of autog- 
enous assembly into a functional receptor, which is 
expressed on the outer surface in a manner accessible to 
ligand. i.e. they are surface-integrated into the phage. 
Typically, the receptor is comprised of a linking polypeptide 
that contains the cpvm membrane anchor domain, such as 
a polypeptide described in Section C. and a non-linking 
polypeptides). 

Because the receptor is linked to the phage in a surface 
accessible manner, the phage can be advantageously used as 
a solid-phase affinity sorbenL In preferred embodiments, the 
phage are linked, preferably removably linked, to a solid 
(aqueous insoluble) matrix such as agarose, cellulose, syn- 
thetic reins, polysaccharides and the like. For example, 
transformants shedding the phage can be applied to and 
retained in a column and maintained under conditions that 
support shedding of the phage. An aqueous composition 
containing a ligand that binds to the receptor expressed by 
the phage is then passed through the column at a predeter- 
mined rate and under receptor-binding conditions to form a 
solid-phase receptor-ligand complex. The column is then 
washed to remove unbound material, leaving the ligand 
bound to the solid-phase phage. The ligand can then be 
removed and recovered by washing the column with a buffer 
that promotes dissociation of the receptor-ligand complex. 

Alternatively, purified phage can be admixed with a 
aqueous solution containing the ligand to be affinity purified. 
The receptor/ligand binding reaction admixture thus formed 
is maintained for a time period and under binding conditions 
sufficient for a phage-linked receptor-ligand complex to 
form. The phage-bound ligand (ligand-bearing phage) are 
then separated and recovered from the unbound materials, 
such as by centrifugation. electrophoresis, precipitation, and 
the like. 

C. DNA Expression Vectors 

A vector of the present invention is a recombinant DNA 
(rDNA) molecule adapted for receiving and expressing 
translatable DNA sequences in the form of a fusion protein 
containing a filamentous phage gene VTQ membrane anchor 
domain and a prokaryotic secretion signal domain. The 
vector comprises a cassette that includes upstream and 
downstream translatable DNA sequences operatively linked 
via a sequence of nucleotides adapted for directional liga- 
tion. The upstream translatable sequence encodes the secre- 
tion signal. The downstream translatable sequence encodes 
the filamentous phage membrane anchor. The cassette pref- 
erably includes DNA expression control sequences for 
expressing the fusion protein that is produced when a 
translatable DNA sequence is directionally inserted into the 
cassette via the sequence of nucleotides adapted for direc- 
tional ligation. 

An expression vector is characterized as being capable of 
expressing, in a compatible host, a structural gene product 
such as a fusion protein of the present invention. 

As used herein, the term 'Vector" refers to a nucleic acid 
molecule capable of transporting between different genetic 
environments another nucleic acid to which it has been 
operatively linked. Preferred vectors are those capable of 
autonomous replication and expression of structural gene 
products present in the DNA segments to which they are 
operatively linked. 

As used herein with regard to DNA sequences or 
segments, the phrase "operatively linked** means the 
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sequences or segments have been covalently joined into one 
shard of DNA. whether in single or double stranded form. 

The choice of vector to which a cassette of this invention 
is operatively linked depends directly, as is well known in 
5 the art. on the functional properties desired, e.g.. vector 
replication and protein expression, and the host cell to be 
transformed, these being limitations inherent in the art of 
constructing recombinant DNA molecules. 
In preferred embodiments, the vector utilized includes a 

10 prokaryotic replicon Le.. a DNA sequence having the ability 
to direct autonomous replication and maintenance of the 
recombinant DNA molecule extra chromosomally in a 
prokaryotic host cell, such as a bacterial host cell, trans- 
formed therewith. Such replicons are well known in the art 

15 In addition, those embodiments that include a prokaryotic 
replicon also include a gene whose expression confers a 
selective advantage, such as drug resistance, to a bacterial 
host transformed therewith. Typical bacterial drug resistance 
genes are those that confer resistance to ampiciilin or 

20 tetracycline. Vectors typically also contain convenient 
restriction sites for insertion of translatable DNA sequences. 
Exemplary vectors are the plasmids pUC8. pUC9, pBR322. 
and pBR329 available from BioRad Laboratories. 
(Richmond. Calif.) and pPL and pKK223 available from 

25 Pharmacia. (Piscataway. N.J.). 

A sequence of nucleotides adapted for directional ligation, 
i.e.. a polylinker. is a region of the DNA expression vector 
that (1) operatively links for replication and transport the 
upstream and downstream translatable DNA sequences and 

30 (2) provides a site or means for directional ligation of a DNA 
sequence into the vector. Typically, a directional polylinker 
is a sequence of nucleotides that defines two or more 
restriction endonuclease recognition sequences, or restric- 
tion sites. Upon restriction cleavage, the two sites yield 

35 cohesive terrnini to which a translatable DNA sequence can 
be ligated to the DNA expression vector. Preferably, the two 
restriction sites provide, upon restriction cleavage, cohesive 
termini that are non-complementary and thereby permit 
directional insertion of a translatable DNA sequence into the 

40 cassette. In one embodiment, the directional ligation means 
is provided by nucleotides present in the upstream translat- 
able DNA sequence, downstream translatable DNA 
sequence, or both. In another embodiment the sequence of 
nucleotides adapted for directional ligation comprises a 

45 sequence of nucleotides that defines multiple directional 
cloning means. Where the sequence of nucleotides adapted 
for directional ligation defines numerous restriction sites, it 
is referred to as a multiple cloning site. 

^ A translatable DNA sequence is a linear series of nucle- 
otides that provide an uninterrupted series of at least 8 
codons that encode a polypeptide in one reading frame. 

An upstream translatable DNA sequence encodes a 
prokaryotic secretion signal. The secretion signal is a leader 

55 peptide domain of protein that targets the protein to the 
periplasmic membrane of gram negative bacteria. 

A preferred secretion signal is a pelB secretion signal. The 
predicted amino acid residue sequences of the secretion 
signal domain from two pelB gene product variants from 

60 Erwinia carotova are shown in Table 1 as described by LcL 
et aL. Nature, 331:543-546 (1988). A particularly preferred 
pelB secretion signal is also shown in Table 1. 

The leader sequence of the pelB protein has previously 
been used as a secretion signal for fusion proteins. Better et 

65 at. Science, 240:1041-1043 (1988); Sastry et al.^mc. NotL 
Acad ScL USA 86:5728-5732 (1989); and MuUinax et aL. 
Prvc. NatL Acaa\ Set. USA 87:8095-8099 (1990). 
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Amino acid residue sequences for other secretion signal 
polypeptide domains from£. coli useful in this invention are 
also listed in Table 1. Oliver. In Neidhard. F. C. (ed.). 
Escherichia colt and Salmonella Typhimurium. American 
Society for Microbiology. Washington. D.C., 1:56-69 
(1987). 

A translatable DNA sequence encoding the pelB secretion 
signal having the amino acid residue sequence shown in 
SEQ. ID. NO. 5 is a preferred DNA sequence for inclusion 
in a DNA expression vector of 

TABLE 1 



Leacfcr Sequences 



SEQ. 

ID. NO. lypc Ammo Acid Residue Sequence 

(5) PelB 1 MeCysIVtUttUuProThrAlaAiaAlsGIyl^uLeu 

LcuLeuAlaAlaGlnProAJaMct 

(6) PelB 3 MetLysTVtlx^uf^lluAJaAIaAJBGIyLcuLeu 

IxulxuAlaAlaGhiProAlaOlnPioAlaMetA^ 

(7) PeJB J MctLy^dxuTkThiProIkAlaAlaGlyLcuLeu 

LcuAiaPbeSciOtn'ryrSeiLcaAla 

(8) MaJE 4 MetLys^ysThiGtyAlaArgrjdL^^ 

Alal^ThrTfarMctMetPbeSerAIaSerAlaLeuAia 
LysBe 

(9) OmpF* MetMetLysArgAsnDeLeuAJaValDeValPioAla 

UulxuVilAlaGI)ThrAJaA5nAiaAJaGlu 

(10) PboA 4 MeU^^lnSerT^lkAlaLeuAJalxuLeuPirf^ 

LeuPheThiPro VaJThrLy sAlaArgThr 

(11) Bl3* KfctSerIV<}liiHisP^A^ValAhLeuneProPhe 

PteAliAlaPbeCysLcuProValPhcAJaHisPro 

(12) LamB 4 MetMetDeThH>uAxgLysUiiP^ * 

VklAlaAlXjlyV^etSciAl^biAlaMetAlttVia 
Asp 

( 13) Lpp 4 MctLysAlaThiLysLeuViaUuGly^ 

GtySexUnLeuLeuAlaGIyCysSer 

(14) cpVnr 5 X^ysLysSciLeuVafl^uLysAJaSerValAlaVal 

AtelbrLeuValPiDMelLcuSefPheAla 

(15) cpm 6 MctLy^yd^euUniPheAl^Prol^VaTValPro 

PheTyrSerlGsSer 

1 pdB used in this invcuiiaa 

3 pelB from Erwinia carvtavora gene 

3 pdB from Erwinia enrotovora EC 16 gene 

4 k2der sequences from £ cod 

'leader sequence for cpVTJJ 

fl Ieeder sequence for cpIH 

this invention. 

A downstream translatable DNA sequence encodes a 
filamentous phage membrane anchor. Preferred membrane 
anchors are obtainable from filamentous phage M13. f 1, f(L 
and the like. Preferred membrane anchor domains are found 
in the coat proteins encoded by gene m and gene VHL Thus, 
a downstream translatable DNA sequence encodes an amino 
acid residue sequence that corresponds, and preferably is 
identical, to the membrane anchor domain of either a fila- 
mentous phage gene HI or gene VHI coat protein. 

The membrane anchor domain of a filamentous phage 
coat protein is the carboxy terminal region of the coat 
protein and includes a region of hydrophobic amino acid 
residues for spanning a lipid bilayer membrane, and a region 
of charged amino acid residues normally found at the 
cytoplasmic face of the membrane and extending away from 
the membrane. Ia the phage fl, gene VDI coat protein's 
membrane spanning region comprises residue Trp-26 
through Lys4o. and the cytoplasmic region comprises the 
carboxy- terminal 11 residues from 41 to 52. Ohkawa et aL, 
J. BioL Chem., 256:995 1-9958 (1981). 

The amino acid residue sequence of a preferred mem- 
brane anchor domain derived from the M13 filamentous 
phage gene IU coat protein (also designated cpflT) has a 
sequence shown in SEQ. ID. NO. 16 from residue 1 to 
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residue 211. Gene III coat protein is present on a mature 
filamentous phage at one end of the phage particle with 
typically about 4 to 6 copies of the coat protein. 
The amino acid residue sequence of a preferred mem- 

5 brace anchor domain derived from the M13 filamentous 
phage gene Vm coat protein (also designated cpVTH) has a 
sequence shown in SEQ. ID. NO. 17 from residue 1 to 
residue 50. Gene VID coat protein is present on a mature 
filamentous phage over the majority of the phage particle 
with typically about 2500 to 3000 copies of the coat protein. 

For detailed descriptions of the structure of filamentous 
phage particles, their coat proteins and particle assembly, see 
the reviews by Rached et al.. Microbiol Rev^, 50:401-427 
(1986); and Model et al. in 'The Bacteriophages: Vol. 2". R. 
Calendar, ed. Plenum Publishing Co.. pp. 375-456. (1988). 

15 A cassette in a DNA expression vector of this invention is 
the region of the vector that forms, upon insertion of a 
translatable DNA sequence, a sequence of nucleotides 
capable of expressing, in an appropriate host a fusion 
protein of this invention. The expression-competent 

M sequence of nucleotides is referred to as a cistron. Thus, the 
cassette comprises DNA expression control elements opera- 
lively linked to the upstream and downstream translatable 
DNA sequences. A cistron is formed when a translatable 
DNA sequence is directionally inserted (directionally 

25 ligated) between the upstream and downstream sequences 
via the sequence of nucleotides adapted for that purpose. 
The resulting three translatable DNA sequences, namely the 
upstream, the inserted and the downstream sequences, are all 
opcratively linked in the same reading frame. 

DNA expression control sequences comprise a set of 
DNA expression signals for expressing a structural gene 
product and include both 5* and 3* elements, as is well 
known, operatively linked to the cistron such that the cistron 
is able to express a structural gene product. The 5' control 
sequences define a promoter for initiating transcription and 

35 a ribosome binding site operatively linked at the 5' terminus 
of the upstream translatable DNA sequence. 

To achieve high levels of gene expression in £ colt, it is 
necessary to use not only strong promoters to generate large 
quantities of mRNA. but also ribosome binding sites to 

40 ensure that the mRNA is efficiently translated. In E. coli. the 
ribosome binding site includes an initiation codon (AUG) 
and a sequence 3-9 nucleotides long located 3-11 nucle- 
otides upstream from the initiation codon [Shine et al.. 
Nature, 25434 (1975)] The sequence. AGGAGGU, which is 

45 called the Shine- Dalgarno (SD) sequence, is complementary 
to the 3' end of £L coll 16S mRNA. Binding of the ribosome 
to mRNA and the sequence at the 3' end of the mRNA can 
be affected by several factors: 

(i) The degree of complementarity between the SD 
50 sequence and 3' end of the 16S tRNA. 

(ii) The spacing and possibly the DNA sequence lying ' 
between the SD sequence and the AUG [Roberts et al.. 
Proc. NatL Acad. ScL USA. 76:760 (1979a); Roberts et 
al.. Prvc. Natl. Acad ScL USA, 76:5596 (1979b); 

55 Guarente et al.. Science, 209:1428 (1980); and 
Guarente et aL. Cell, 20:543 (1980).] Optimization is 
achieved by measuring the level of expression of genes 
in plasmids in which this spacing is systematically 
altered. Comparison of different mRNAs shows that 

60 there are statistically preferred sequences from posi- 
tions -20 to +13 (where the A of the AUG is position 
0) [Gold et aL Anna. Rev, Microbiol, 35:365 (1981)]. 
Leader sequences have been shown to influence trans- 
lation dramatically (Roberts et aL. 1979 a. b supra). 

65 (iii) The nucleotide sequence following the AUG. which 
affects ribosome binding [Taniguchi et aL. J. Mol Biol^ 
118:533 (1978)). 
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Useful ribosomc binding sites are shown in Table 2 below. 



TABLE 2 







Ribosome Binding Sites* 




SEQ. 






ID. NO. 




1. 


(18) 


5* AAUCWGGAGGCUUIJUU1JAUCX3UUCGUUCU 


2. 


(19) 


5* UAACUAAGGAUGAAAUGCATTG1JCUAAGACA 


3. 


(20) 


5' UCCUAGGAGGUUUGACCUATJCCGAGCUUUU 


4. 


(21) 


5' AUGU AC U AAGG AGGUUGUATKX3 AAC AACGC 







* Sequences of initiation regions for protein synthesis in four phage mRNA 

molecules are under lined. 

AUG = initiation codon (double underlined) 

1. = Phage $X174 gene-A protein 

2. = Phage Q0 repiicase 

3. = Phage R17 gene-A protein 

4. - Phage lambda gene-cro protein 

The 3* control sequences define at least one termination 
(stop) codon in frame with and operativeJy linked to the 
downstream translatable DNA sequence. 

Thus, a DNA expression vector of this invention provides 
a system for cloning translatable DNA sequences into the 
cassette portion of the vector to produce a cistron capable of 
expressing a fusion protein of this invention. 

In preferred embodiments, a DNA expression vector 
provides a system for independently cloning two translatable 
DNA sequences into two separate cassettes present in the 
vector, to form two separate cistrons for expressing both 
polypeptides of a heterodimeric receptor, or the ligand 
binding portions of the polypeptides that comprise a het- 
erodimeric receptor. The DNA expression vector for 
expressing two cistrons is referred to as a dicistronic expres- 
sion vector. 

Thus, a preferred DNA expression vector of this invention 
comprises, in addition to the cassette previously described in 
detail, a second cassette for expressing a second fusion 
protein. The second cassette includes a third translatable 
DNA sequence that encodes a secretion signal, as defined 
herein before, operativdy linked at its 3* terminus via a 
sequence of nucleotides adapted for directional ligation to a 
downstream DNA sequence defining at least one stop codon. 
The third translatable DNA sequence is operatively linked at 
its 5* terminus to DNA expression control sequences forming 
the 5* elements defined above. The second cassette is 
capable, upon insertion of a translatable DNA sequence, of 
expressing the second fusion protein. 

In a preferred embodiment, a DNA expression vector is 
designed for convenient manipulation in the form of a 
filamentous phage particle according to the teachings of the 
present invention. In this embodiment a DNA expression 
vector further contains a nucleotide sequence that defines a 
filamentous phage origin of replication such that the vector, 
upon presentation of the appropriate genetic 
complementation, can replicate as a filamentous phage in 
single stranded replicative form and be packaged into fila- 
mentous phage particles. This feature provides the ability of 
the DNA expression vector to be packaged into phage 
particles for subsequent segregation of the particle, and 
vector contained therein, away from other particles that 
comprise a population of phage particles. 

A filamentous phage origin of replication is a region of the 
phage genome, as is well known, that defines sites for 
initiation of replication, termination of replication and pack- 
aging of the replicative form produced by replication. See, 
for example, Rasched ct at, Microbiol Rev., 50:401-427 
(1986); and Horiuchi. J. Mol Biol. 188215-223 (1986). 

A preferred filamentous phage origin of replication for use 
in the present invention is a M 13. f 1 or fd phage origin of 
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replication. Particularly preferred is a filamentous phage 
origin of replication having a sequence shown in SEQ. DO. 
NO. 117 and described by Short et al.. NucL Acids Res., 
16:7583-7600 (1988). Preferred DNA expression vectors 
5 are the dicistronic expression vectors pCOMB8 and 
pCOMB3 described in Example lb(i) and lb(ii). respec- 
tively. 

D. Polypeptides 

10 In another embodiment, the present invention contem- 
plates a polypeptide comprising an insert domain flanked by 
an ammo-terminal secretion signal domain and a carboxy- 
terminal filamentous phage coat protein membrane anchor 
domain. 

15 Preferably, the polypeptide is a fusion protein having a 
receptor domain comprised of an amino acid residue 
sequence that defines the ligand binding domain of a recep- 
tor protein positioned between a prokaryotic secretion signal 
domain and a gene VTLI -encoded (cpVDI) membrane anchor 

20 domain. In preferred embodiments, the receptor protein is a 
polypeptide chain of a heterodimeric receptor. Insofar as the 
polypeptide has a receptor domain, it is also referred to 
herein as a receptor. In other preferred embodiments the 
secretion signal domain is a pelB secretion signal as 

25 described herein. 

Preferred heterodimeric receptors include 
immunoglobulins, major histocompatibility antigens of 
class I or IL lymphocyte receptors, integrins and the like 

jq heterodimeric receptors. 

In one embodiment, a polypeptide of this invention has an 
amino acid residue sequence that can be represented by the 
formula, shown in the direction of amino- to carboxy 
terminus: 

33 NH 2 — O — (XJ]^ — V — (XX — Z— COOH, (Fl) 

where O represents an amino acid residue sequence defining 
a secretion signal, U represents a first spacer polypeptide. V 
represents an amino acid residue sequence defining a recep- 
40 tor domain. X represents a second spacer polypeptide, and Z 
represents an amino acid residue sequence defining a fila- 
mentous phage coat protein membrane anchor, with the 
proviso that ra is the integer 0 or 1 such that when m is O. 
U is not present, and when m is 1, U is present and n is 0 
45 or 1 such that when n is O, X is not present and when n is 
1, X is present. 

In the formula (Fl), the secretion signal and the filamen- 
tous phage coat protein membrane anchor are as defined 
herein above. Particularly preferred is a polypeptide accord- 
so ing to formula (Fl) where Z defines the gene VTTJ membrane 
anchor as described herein. In another preferred embodi- 
ment the secretion signal is the pelB secretion signal. 

In one ernbodiment. V is an amino acid residue sequence 
that defines the ligand binding domain of a chain of a 
55 heterodimeric receptor molecule, and preferably is an immu- 
noglobulin variable region polypeptide. In a particularly 
preferred polypeptide V is a W H ox V L polypeptide. Most 
preferred is a polypeptide where V is an iinmunoglobulin V w 
polypeptide, and m and n are both zero. 
60 In another embodiment, U or X can define a proteolytic 
cleavage site, such as the sequence of amino acids found in 
a precursor protein, such as prothrombin, factor X and the 
like, that defines the site of cleavage of the protein. A fusion 
protein having a cleavage site provides a means to purify the 
65 protein away from the phage particle to which it is attached. 
The polypeptide spacers U and X can each have any 
sequence of amino acid residues of from about 1 to 6 amino 
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acid residues in length. Typically the spacer residues are 
present in a polypeptide to accommodate the continuous 
reading frame that is required when a polypeptide is pro- 
duced by the methods disclosed herein using a DNA expres- 
sion vector of this invention. 

A receptor of the present invention assumes a conforma- 
tion having a binding site specific for. as evidenced by its 
ability to be competitively inhibited, a preselected or pre- 
determined ligand such as an antigen, enzymatic substrate 
and the like. In one embodiment, a receptor of this invention 
is a ligand binding polypeptide that forms an antigen binding 
site which specifically binds to a preselected antigen to form 
a complex having a sufficiently strong binding between the 
antigen and the binding site for the complex to be isolated. 
When the receptor is an antigen binding polypeptide its 
affinity or avidity is generally greater than 10 s M" 1 more 
usually greater than 10 6 and preferably greater than \(f M~ l . 

In another embodiment, a receptor of the subject inven- 
tion binds a substrate and catalyzes the formation of a 
product from the substrate. While the topology of the ligand 
binding site of a catalytic receptor is probably more impor- 
tant for its preselected activity than its affinity (association 
constant or pKa) for the substrate, the subject catalytic 
receptors have an association constant for the preselected 
substrate generally greater than 10 3 M~\ more usually 
greater than 10 5 M~ l or 10 6 M" 1 and preferably greater than 
10 7 M" 1 . 

Preferably the receptor produced by the subject invention 
is heterodimeric and is therefore normally comprised of two 
different polypeptide chains, which together assume a con- 
formation having a binding affinity, or association constant 
for the preselected ligand that is different preferably higher, 
than the affinity or association constant of either of the 
polypeptides alone, i.e.. as monomers. One or both of the 
different polypeptide chains is derived from the variable 
region of the Light and heavy chains of an immunoglobulin. 
Typically, polypeptides comprising the light (VJ and heavy 
(V w ) variable regions are employed together for binding the 
preselected ligand. 

A receptor produced by the subject invention can be 
active in monomelic as well as miiltimeric forms, either 
homomcric or heteromeric. preferably heterodimeric. For 
example. V H and V L ligand binding polypeptide produced 
by the present invention can be advantageously combined in 
the heterodimer to modulate the activity of either or to 
produce an activity unique to the heterodimer. 

The individual ligand polypeptides will be referred to as 
V w and and the heterodimer will be referred to as a Fv. 
However, it should be understood that a V H may contain in 
addition to the V w , substantially all or a portion of the heavy 
chain constant region. Similarly, a V L may contain, in 
addition to the V L . substantially all or a portion of the light 
chain constant region. A heterodimer comprised of a W H 
containing a portion of the heavy chain constant region and 
a Vl containing substantially all of the light chain constant 
region is termed a Fab fragment. The production of Fab can 
be advantageous in some situations because the additional 
constant region sequences contained in a Fab as compared to 
a Fv can stabilize the V H and \ L interaction. Such stabili- 
zation can cause the Fab to have higher affinity for antigen. 
In addition the Fab is more commonly used in the art and 
thus there are more commercial antibodies available to 
specifically recognize a Fab in screening procedures. 

The individual V H and V L polypeptides can be produced 
in lengths equal to or substantially equal to their naturally 
occurring lengths. However, in preferred embodiments, the 
V H and V L polypeptides wfll generally have fewer than 125 
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amino acid residues, more usually fewer than about 120 
amino acid residues, while normally having greater than 60 
amino acid residues, usually greater than about 95 amino 
acid residues, more usually greater than about 100 amino 
5 acid residues. Preferably, the V H will be from about 1 10 to 
about 230 amino acid residues in length while V L will be 
from about 95 to about 214 amino acid residues in length. 
V H and V L chains sufficiently long to form Fabs are pre- 
ferred. 

The amino acid residue sequences will vary widely, 
depending upon the particular idiotype involved. Usually, 
there will be at least two cysteines separated by from about 
60 to 75 amino acid residues and joined by a disulfide bond. 
The polypeptides produced by the subject invention will 
normally be substantial copies of ioUotypes of the variable 
regions of the heavy and/or light chains of 
immunoglobulins, but in some situations a polypeptide may 
contain random mutations in amino acid residue sequences 
in order to advantageously improve the desired activity. 

In some situations, it is desirable to provide for covalent 
20 cross linking of the V H and polypeptides, which can be 
accomplished by providing cysteine resides at the carboxyl 
termini. The polypeptide will normally be prepared free of 
the imnuino^obulin constant regions, however a small 
portion of the J region may be included as a result of the 
25 advantageous selection of DNA synthesis rximers. The D 
region will normally be included in the transcript of the V w . 

Typically the C terminus region of the V H and V L 
polypeptides will have a greater variety of sequences than 
the N terminus and. based on the present strategy, can be 
30 further modified to permit a variation of the normally 
occurring V w and V L chains. A synthetic polynucleotide can 
be employed to vary one or more amino acid in a hyper- 
variable region. 

In another embodiment, the invention contemplates a 
heterodimeric receptor molecule comprised of two polypep- 
tide chains, at least one of which is a polypeptide of this 
invention. Preferably, the polypeptide comprises a receptor 
domain derived from an immunoglobulin variable chain. 
^ more preferably a V w . More preferred is a heterodimeric 
receptor comprising receptor domains from both and 
chains. . 

E. Methods for Producing a Library 
1. General Rationale 

45 In one embodiment the present invention provides a 
system for the simultaneous cloning and screening of pre- 
selected ligand-binding specificities from gene repertoires 
using a single vector system. This system provides linkage 
of cloning and screening methodologies and has two 

50 requirements. First, that expression of the polypeptide 
chains of a heterodimeric receptor in an in vitro expression 
host such as E. coti requires coexpression of the two 
polypeptide chains in order that a functional heterodimeric 
receptor can assemble to produce a receptor that binds 

55 ligand. Second, that screening of isolated members of the 
library for a preselected ligand-binding capacity requires a 
means to correlate the binding capacity of an expressed 
receptor molecule with a convenient means to isolate the 
gene that encodes the member from the library. 

60 Linkage of expression and screening is accomplished by 
the combination of targeting of a fusion protein into the 
periplasm of a bacterial cell to allow assembly of a func- 
tional receptor, and the targeting of a fusion protein onto the 
coat of a filamentous phage particle during phage assembly 

65 to allow for convenient screening of the library member of 
interest. Periplasmic targeting is provided by the presence of 
a secretion signal domain in a fusion protein of this inven- 
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tion. Targeting to a phage particle is provided by the 
presence of a filamentous phage coat protein membrane 
anchor domain in a fusion protein of this invention. 

The present invention describes in one embodiment a 
method for producing a library of DNA molecules, each 
DNA molecule comprising a cistron for expressing a fusion 
protein on the surface of a filamentous phage particle. The 
method comprises the steps of (a) forming a ligation admix- 
ture by combining in a ligation buffer (i) a repertoire of 
polypeptide encoding genes and (ii) a plurality of DNA 
expression vectors in linear form adapted to form a fusion 
protein expressing cistron. and (b) subjecting the admixture 
to ligation conditions for a time period sufficient for the 
repertoire of genes to become operatively linked (ligated) to 
the plurality of vectors to form the library. 

In this embodiment, the repertoire of polypeptide encod- 
ing genes are in the form of double-stranded (ds) DNA and 
each member of the repertoire has cohesive termini adapted 
for directional ligation. In addition, the plurality of DNA 
expression vectors are each linear DNA molecules having 
upstream and downstream cohesive termini that are (a) 
adapted for directionally receiving the polypeptide genes in 
a common reading frame, and (b) operatively linked to 
respective upstream and downstream translatable DNA 
sequences. The upstream translatable DNA sequence 
encodes a secretion signal, preferably a pelB secretion 
signal, and the downstream translatable DNA sequence 
encodes a filamentous phage coat protein membrane anchor 
as described herein for a polypeptide of this invention- The 
translatable DNA sequences are also operatively linked to 
respective upstream and downstream DNA expression con- 
trol sequences as defined for a DNA expression vector 
described herein. 

* The library so produced can be utilized for expression and 
screening of the fusion proteins encoded by the resulting 
library of cistrons represented in the library by the expres- 
sion and screening methods described herein. 
2. Production of Gene Repertoires 

A gene repertoire is a collection of different genes, 
preferably polypeptide-encoding genes (polypeptide genes), 
and may be isolated from natural sources or can be generated 
artificially. Preferred gene repertoires are comprised of con- 
served genes. Particularly preferred gene repertoires com- 
prise either or both genes that code for the members of a 
dimeric receptor molecule. 

A gene repertoire useful in practicing the present inven- 
tion contains at least 10 3 . preferably at least 10 4 . more 
preferably at least 10 5 , and most preferably at least 10 7 
different genes. Methods for evaluating the diversity of a 
repertoire of genes is well known to one skilled in the art 

Thus, in one embediment, the present invention contem- 
plates a method of isolating a pair of genes coding for a 
dimeric receptor having a preselected activity from a rep- 
ertoire of conserved genes. Additionally, expressing the 
cloned pair of genes and isolating the resulting expressed 
dimeric receptor protein is also described. Preferably, the 
receptor will be a heterodimeric polypeptide capable of 
binding a ligand. such as an antibody molecule or immu- 
nologically active portion thereof, a cellular receptor, or a 
cellular adhesion protein coded for by one of the members 
of a family of conserved genes, i.e., genes containing a 
conserved nucleotide sequence of at least about 10 nucle- 
otides in length. 

Exemplary conserved gene families encoding different 
polypeptide chains of a dimeric receptor are those coding for 
immunoglobulins, major histocompatibility complex anti- 
gens of class I or U. lymphocyte receptors, integrins and the 
like. 



19-81 7 

18 

A gene can be identified as belonging to a repertoire of 
conserved genes using several methods. For example, an 
isolated gene may be used as a hybridization probe under 
low stringency conditions to detect other members of the 

5 repertoire of conserved genes present in genomic DNA 
using the methods described by Southern. / MoL BioU 
98:503 (1975). If the gene used as a hybridization probe 
hybridizes to multiple restriction endonuclease fragments of 
the genome, that gene is a member of a repertoire of 

10 conserved genes. 
Immunoglobulins 

The immunoglobulins, or antibody molecules, are a large 
family of molecules that include several types of molecules, 
such as IgD. IgG, IgA. IgM and IgE. The antibody molecule 

15 is typically comprised of two heavy (H) and light (L) chains 
with both a variable (V) and constant (Q region present on 
each chain as shown in FIG. 1. Schematic diagrams of 
human IgG heavy chain and human kappa light chain are 
shown in FIGS. 2A and 2B. respectively. Several different 

20 regions of an immunoglobulin contain conserved sequences 
useful for isolating an immunoglobulin repertoire. Extensive 
amino acid and nucleic acid sequence data displaying exem- 
plary conserved sequences is compiled for immunoglobulin 
molecules by Kabat et al., in Sequences of Proteins of 

25 Immunological Interest. National Institutes of Health. 
Bethesda. Md. 1987. 

The C region of the H chain defines the particular immu- 
noglobulin type. Therefore the selection of conserved 
sequences as defined herein from the C region of the H chain 

30 results in the preparation of a repertoire of immunoglobulin 
genes having members of the immunoglobulin type of the 
selected C region. 

The V region of the H or L chain typically comprises four 
framework (FR) regions each containing relatively lower 

35 degrees of variability that includes lengths of conserved 
sequences. The use of conserved sequences from the FR1 
and FR4 (J region) framework regions of the V M chain is a 
preferred exemplary embodiment and is described herein in 
the Examples. Framework regions are typically conserved 

40 across several or all immunoglobulin types and thus con- 
served sequences contained therein are particularly suited 
for preparing repertoires having several immunoglobulin 
types. 

Major Histocompatibility Complex 

45 The major histocompatibility complex (MHC) is a large 
genetic locus that encodes an extensive family of proteins 
that include several classes of molecules referred to as class 
L class n or class IH MHC molecules. Paul et al.. in 
Fundamental Immunology. Raven Press. N.Y.. pp. 303-378 

50 (1984). 

Class I MHC molecules are a polymorphic group of 
transplantation antigens representing a conserved family in 
which the antigen is comprised of a heavy chain and a 
non-MHC encoded light chain. The heavy chain includes 

55 several regions, termed the N. CI, C2. membrane and 
cytoplasmic regions. Conserved sequences useful in the 
present invention are found primarily in the N. CI and C2 
regions and are identified as continuous sequences of 
"invariant residues'* in Kabat et aL. supra. 

60 Class II MHC molecules comprise a conserved family of 
polymorphic antigens that participate in immune respon- 
siveness and are comprised of an alpha and a beta chain. The 
genes coding for the alpha and beta chain each include 
several regions that contain conserved sequences suitable for 

65 producing MHC class II alpha or beta chain repertoires. 
Exemplary conserved nucleotide sequences include those 
coding for amino acid residues 26—30 of the Al region. 
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residues 161-170 of the A2 region and residues 195-206 of 
the membrane region, all of the alpha chain. Conserved 
sequences are also present in the Bl, B2 and membrane 
regions of the beta chain at nucleotide sequences coding for 
amino acid residues 41^5. 150-162 and 200-209. respec- 
tively. 

Lymphocyte Receptors and Cell Surface Antigens 

Lymphocytes contain several families of proteins on their 
cell surfaces including the T-cell receptor. Thy- 1 antigen and 
numerous T-cell surface antigens including the antigens 
defined by the monoclonal antibodies OKT4 (leu3). OKT5/8 
(leu2). OKX3. OKT1 (leul), OKT 11 (leu5) OKT6 and 
OKT9. Paul, supra at pp. 458^*79. 

The T-cell receptor is a term used for a family of antigen 
binding molecules found on the surface of T-cells. The T-cell 
receptor as a family exhibits polymorphic binding specificity 
similar to immunoglobulins in its diversity. The mature 
T-cell receptor is comprised of alpha and beta chains each 
having a variable (V) and constant (Q region. The similari- 
ties that the T-cell receptor has to immunoglobulins in 
genetic organization and function shows that T-cell receptor 
contains regions of conserved sequence. Lai et al.. Nature, 
331:542-546 (1988). 

Exemplary conserved sequences include those coding for 
amino acid residues 84-90 of alpha chain, amino acid 
residues 107-115 of beta chain, and amino acid residues 
91-95 and 111-116 of the gamma chain. Kabat et aL. supra, 
p. 279. 

Integrins And Adhesions 

Adhesive proteins involved in cell attachment are mem- 
bers of a large family of related proteins termed integrins. 
Integrins are heterodimers comprised of a beta and an alpha 
subunit. Members of the integrin family include the cell 
surface glycoproteins platelet receptor GpHb-IIIa. vitronec- 
tin receptor (VnR). fibronectin receptor (FnR) and the leu- 
kocyte adhesion receptors LFA-1. Mac-1. Mo-1 and 60.3. 
Rouslahti et aL, Science, 238:491-497 (1987). Nucleic acid 
and protein sequence data demonstrates regions of con- 
served sequences exist in the members of these families, 
particularly between the beta chain of GpHb-IIIa, VnR and 
FnR. and between the alpha subunit of VnR. Mac-1, LFA-1. 
FnR and Gpnb-IHa. Suzuki et ai., Proc. NatL Acad. Set. 
USA 83:8614-8618, 1986; Ginsberg et al., J. BioL Chem., 
262:5437-5440. 1987. 

Various well known methods can be employed to produce 
a useful gene repertoire. For instance. V w and V L gene 
repertoires can be produced by isolating V^- and V^-coding 
mRNA from a heterogeneous population of antibody pro- 
ducing cells, i.e.. B lymphocytes (B cells), preferably rear- 
ranged B cells such as those found in the circulation or 
spleen of a vertebrate. Rearranged B cells arc those in which 
immunoglobulin gene translocation, i.e., rearrangement, has 
occurred as evidenced by the presence in the cell of mRNA 
with the immunoglobulin gene V, D and J region transcripts 
adjacently located thereon. Typically, the B cells are col- 
lected in a 1-100 ml sample of blood which usually contains 
10* B cells/mL 

In some cases, it is desirable to bias a repertoire for a 
preselected activity, such as by using as a source of nucleic 
acid cells (source cells) from vertebrates in any one of 
various stages of age, health and immune response. For 
example, repeated immunization of a healthy animal prior to 
collecting rearranged B cells results in obtaining a repertoire 
enriched for genetic material producing a receptor of high 
affinity. Mullinax et al.. Proc. NatL Acad. ScL USA, 
87:8095-8099 (1990). Conversely, collecting rearranged B 
cells from a healthy animal whose immune system has not 
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been recently challenged results in producing a repertoire 
that is not biased towards the production of high affinity V^. 
and/or V^ polypeptides. 

It should be noted the greater the genetic heterogeneity of 

5 the population of cells for which the nucleic acids are 
obtained, the greater the diversity of the immunological 
repertoire (comprising V^ and V r coding genes) that will 
be made available for screening according to the method of 
the present invention. Thus, cells from different individuals. 

10 particularly those having an immunologically significant age 
difference, and cells from individuals of different strains, 
races or species can be advantageously combined to increase 
the heterogeneity (diversity) of a repertoire. 
Thus, in one preferred ernbodiment, the source cells are 

15 obtained from a vertebrate, preferably a mammal, which has 
been immunized or partially immunized with an antigenic 
ligand (antigen) against which activity is sought i.e., a 
preselected antigen. The immunizatioq can be carried out 
conventionally. Antibody titer in the animal can be moni- 

20 tored to determine the stage of immunization desired, which 
stage corresponds to the amount of enrichment or biasing of 
the repertoire desired. Partially immunized animals typically 
receive only one immunization and cells are collected from 
those animals shortly after a response is detected. Fully 

25 immunized animals display a peak titer, which is achieved 
with one or more repeated injections of the antigen into the 
host raammai. normally at 2 to 3 week intervals. Usually 
three to five days after the last challenge, the spleen is 
removed and the genetic repertoire of the spleDocytes. about 

30 90% of which are rearranged B cells, is isolated using 
standard procedures. See, Current Protocols in Molecular 
Biology. Ausubel et al.. eds.. John Wiley & Sons, NY. 
Nucleic acids coding for V H and V L polypeptides can be 
derived from cells producing IgA. IgD, IgE. IgG or IgM, 

35 most preferably from IgM and IgG. producing cells. 

Methods for preparing fragments of genomic DNA from 
which immunoglobulin variable region genes can be cloned 
as a diverse population axe well known in the art. See for 
example Herrmann et al.. Methods In EnzymoL, 

40 152:180-183. (1987); Frischauf, Methods In EnzymoL, 
152:183-190 (1987); Frischauf. Methods In EnzymoL, 
152:190-199 (1987); and DiLeila et al.. Methods In 
EnzymoL, 152:199-212 (1987). (The teachings of the ref- 
erences cited herein are hereby incorporated by reference.) 

45 The desired gene repertoire can be isolated from either 
genomic material containing the gene expressing the vari- 
able region or the messenger RNA (mRNA) which repre- 
sents a transcript of the variable region. The difficulty in 
using the genomic DNA from other than non-rearranged B 

so lymphocytes is in juxtaposing the sequences coding for the 
variable region, where the sequences are separated by 
introns. The DNA fragment(s) containing the proper exons 
must be isolated, the introns excised, and the exons then 
spliced in the proper order and in the proper orientation. For 

55 the most part this will be difficult, so that the alternative 
technique employing rearranged B cells will be the method 
of choice because the V. D and J immunoglobulin gene 
regions have translocated to become adjacent, so that the 
sequence is continuous (free of introns) for the entire 

60 variable regions. 

Where mRNA is utilized the cells will be lysed under 
RNase inhibiting conditions. In one embodiment the first 
step is to isolate the total cellular mRNA. Foly A+ mRNA 
can then be selected by hybridization to an oligo-dT cellu- 

65 lose column. The presence of mRNAs coding for the heavy 
and/or light chain polypeptides can then be assayed by 
hybridization with DNA single strands of the appropriate 
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genes. Conveniently, the sequences coding for the constant 
portion of the V w and can be used as polynucleotide 
probes, which sequences can be obtained from available 
sources. See for example. Early and Hood. Genetic 
Engineering. Setlow and Hollaender. eds.. Vol. 3. Plenum 
Publishing Corporation. NY. (1981). pages 157-188; and 
Rabat et aL. Sequences of Immunological Interest. National 
Institutes of Health. Bethesda. Md. (1987). 

In preferred embodiments, the preparation containing the 
total cellular mRNA is first enriched for the presence of V„ 
and/or V L coding mRNA. Enrichment is typically accom- 
plished by subjecting the total mRNA preparation or par- 
tially purified mRNA product thereof to a primer extension 
reaction employing a polynucleotide synthesis primer as 
described herein. Exemplary methods for producing V„ and 
V £ gene repertoires using polynucleotide synthesis primers 
are described in PCT Application No. PCTAJS 90/02836 
(International Publication No. WO 90/14430). Particularly 
preferred methods for producing a gene repertoire rely on 
the use of preselected oligonucleotides as primers in a 
polymerase chain reaction (PCR) to form PCR reaction 
products as described herein. 

In preferred embodiments, isolated B cells are immunized 
in vitro against a preselected antigen. In vitro immunization 
is defined as the clonal expansion of epitope-specifk B cells 
in culture, in response to antigen stimulation. The end result 
is to increase the frequency of antigen-specific B cells in the 
immunoglobulin repertoire, and thereby decrease the num- 
ber of clones in an expression library that must be screened 
to identify a clone expressing an antibody of the desired 
specificity. The advantage of in vitro immucizatioD is that 
human monoclonal antibodies can be generated against a 
limitless number of therapeutically valuable antigens, 
including toxic or weak immunogens. For example, anti- 
bodies specific for the polymorphic determinants of tumor- 
associated antigens, rheumatoid factors, and histocompat- 
ibility antigens can be produced, which can not be elicited 
in immunized animals. In addition, it may be possible to 
generate immune responses which are normally suppressed 
in vivo. 

In vitro immunization can be used to give rise to either a 
primary or secondary immune response. A primary immune 
response, resulting from first time exposure of a B cell to an 
antigen, results in clonal expansion of epitope- specific cells 
and the secretion of IgM antibodies with low to moderate 
apparent affinity constants (lO^lO 8 M"* 1 ). Primary immu- 
nization of human splenic and tonsillar lymphocytes in 
culture can be used to produce monoclonal antibodies 
against a variety of antigens, including cells, peptides, 
macromolecule. haptens, and tumor-associated antigens. 
Memory B cells from immunized donors can also be stimu- 
lated in culture to give rise to a secondary immune response 
characterized by clonal expansion and the production of 
high affinity antibodies (>i(f M~ J ) of the IgG isotype. 
particularly against viral antigens by clonally expanding 
sensitized lymphocytes derived from seropositive individu- 
als. 

In one embodiment peripheral blood lymphocytes are 
depleted of various cytolytic cells that appear to down- 
modulate antigen-specific B cell activation. When 
ly so some -rich subpopulations (natural killer cells* cytotoxic 
and suppressor T cells, monocytes) are first removed by 
treatment with the lysosmotropic methyl ester of leucine, the 
remaining cells (including B cells. T helper cells, accessory 
cells) respond antigen-specificaUy during in vitro immuni- 
zation. The lympholrine requirements for inducing antibody 
production in culture are satisfied by a culture supernatant 
from activated, irradiated T cells. 
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In addition to in vitro immunization, cell panning 
(immunoaffinity absorption) can be used to further increase 
the frequency of antigen-specific B cells. Techniques for 
selecting B cell subpopulations via solid-phase antigen 
5 binding are well established. Panning conditions can be 
optimized to selectively enrich for B cells which bind with 
high affinity to a variety of antigens, including cell surface 
proteins. Panning can be used alone, or in combination with 
in vitro immunization to increase the frequency of antigen- 
ic) specific cells above the levels which can be obtained with 
either technique alone. Immunoglobulin expression libraries 
constructed from enriched populations of B cells are biased 
in favor of antigen-specific antibody clones, and thus, 
enabling identification of clones with the desired specifici- 

15 ties from smaller, less complex libraries. 
3. Preparation of Polynucleotide Primers 

The term "polynucleotide" as used herein in reference to 
primers, probes and nucleic acid fragments or segments to 
be synthesized by primer extension is defined as a molecule 

20 comprised of two or more deoxyribonucleotide or 
ribonucleotides, preferably more than 3. Its exact size will 
depend on many factors, which in turn depends on the 
ultimate conditions of use. 
The term "primer" as used herein refers to a polynucle- 

25 otide whether purified from a nucleic acid restriction digest 
or produced synthetically, which is capable of acting as a 
point of initiation of nucleic acid synthesis when placed 
under conditions in which synthesis of a primer extension 
product which is complementary to a nucleic acid strand is 

30 induced. Le.. in the presence of nucleotides and an agent for 
polymerization such as DNA polymerase, reverse tran- 
scriptase and the like, and at a suitable temperature and pH. 
The primer is preferably single stranded for maximum 
efficiency, but may alternatively be in double stranded form. 

35 If double stranded, the primer is first treated to separate it 
from its complementary strand before being used to prepare 
extension products. Preferably, the primer is a polydeoxyri- 
bonucleotide. The primer must be sufficiently long to prime 
the synthesis of extension products in the presence of the 

40 agents for polymerization. The exact lengths of the primers 
will depend on may factors, including temperature and the 
source of primer. For example, depending on the complexity 
of the target sequence, a polynucleotide primer typically 
contains 15 to 25 or more nucleotides, although it can 

45 contain fewer nucleotides. Short primer molecules generally 
require cooler temperatures to form sufficiently stable hybrid 
complexes with template. 

The primers used herein are selected to be "substantially" 
complementary to the different strands of each specific 

50 sequence to be synthesized or amplified. This means that the 
primer must be sufficiently complementary to no n -randomly 
hybridize with its respective template strand. Therefore, the 
primer sequence may or may not reflect the exact sequence 
of the template. Fox example, a non-complementary nucle- 

55 otide fragment can be attached to the 5* end of the primer, 
with the remainder of the primer sequence being substan- 
tially complementary to the strand. Such oon- 
complementary fragments typically code for an endonu- 
clease restriction site. Alternatively, non-complementary 

60 bases or longer sequences can be interspersed into the 
- primer, provided the primer sequence has sufficient comple- 
mentarity with the sequence of the strand to be synthesized 
or amplified to non-randomly hybridize therewith and 
thereby form an extension product under polynucleotide 

65 synthesizing conditions. 

Primers of the present invention may also contain a 
DNA-dependent RNA polymerase promoter sequence or its 
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complement. See for example. Krieg et al., NucL Acids Res^ 
12:7057-70 (1984); Studier et al.. J. Mol. BioL. 
189: 1 13-130 { 1986); and Molecular Cloning: A Laboratory 
Manual Second Edition, Mania tis et al. eds.. Cold Spring 
Harbor. N.Y. (1989). 

When a primer containing a DNA-dependent RNA poly- 
merase promoter is used the primer is hybridized to the 
polynucleotide strand to be amplified and the second poly- 
nucleotide strand of the DNA-dcpendent RNA polymerase 
promoter is completed using an inducing agent such as E. 
coli DNA polymerase L or the Klenow fragment of £ coli 
DNA polymerase. The starting polynucleotide is amplified 
by alternating between the production of an RNA polynucle- 
otide and DNA polynucleotide- 

Ptimers may also contain a template sequence or repli- 
cation initiation site for a RNA-directed RNA polymerase. 
Typical RNA-directed RNA polymerase include the QB 
replicase described by lizardi et al.. Biotechnology, 
6:1197-1202 (1988) RNA-directed polymerases produce 
large numbers of RNA strands from a small number of 
template RNA strands that contain a template sequence or 
replication initiation site. These polymerases typically give 
a one nullion-fold amplification of the template strand as has 
been described by Kramer et aL. J. Mol Biol^ 89*719-736 
(1974). 

The polynucleotide primers can be prepared using any 
suitable method, such as. for example, the phosphotriester or 
phosphodiestcr methods see Narang et al. Meth. EnzymoL 
68:90, (1979); VS. Pat. No. 4356.270; and Brown et al. 
Meth. EnzymoL 68:109, (1979). 

The choice of a primer's nucleotide sequence depends on 
factors such as the distance on the nucleic acid from the 
region coding for the desired receptor, its hybridization site 
on the nucleic acid relative to any second primer to be used, 
the number of genes in the repertoire it is to hybridize to. and 
the like. 

a. Primers for Producing Immunoglobulin Gene Repertoires 
V w and gene repertoires can be separately prepared 
prior to their utilization in the present invention. Repertoire 
preparation is typically accomplished by primer extension, 
preferably by primer extension in a polymerase chain reac- 
tion (PGR) format 

To produce a repertoire of V^oding DNA homologs by 
primer extension, the nucleotide sequence of a primer is 
selected to hybridize with a plurality of immunoglobulin 
heavy chain genes at a site substantially adjacent to the 
V /r coding region so that a nucleotide sequence coding for 
a functional (capable of binding) polypeptide is obtained. To 
hybridize to a plurality of different V^coding nucleic acid 
strands, the primer must be a substantial complement of a 
nucleotide sequence conserved among the different strands. 
Such sites include nucleotide sequences in the constant 
region, any of the variable region framework regions, pref- 
erably the third framework region, leader region, promoter 
region. J region and the like. 

If the repertoires of V^-coding and V L -coding DNA 
homologs are to be produced by (PGR) amplification, two 
primers. Le., a PCR primer pair, must be used for each 
coding strand of nucleic acid to be amplified. The first 
primer becomes part of the nonsense (minus or 
complementary) strand and hybridizes to a nucleotide 
sequence conserved among V H (plus or coding) strands 
within the repertoire. To produce coding DNA 
homologs. first primers are therefore chosen to hybridize to 
(i.e. be complementary to) conserved regions within the J 
region. CHI region, hinge region. CH2 region, or CH3 
region of immunoglobulin genes and the like. To produce a 
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V L coding DNA homolog. first primers are chosen to hybrid- 
ize with (i.e. be complementary to) a conserved region 
within the J region or constant region of immunoglobulin 
light chain genes and the like. Second primers become part 

5 of the coding (plus) strand and hybridize to a nucleotide 
sequence conserved among minus strands. To produce the 
V^coding DNA homologs. second primers are therefore 
chosen to hybridize with a conserved nucleotide sequence at 
the 5' end of the V^-coding immunoglobulin gene such as in 

10 that area coding for the leader or first framework region. It 
should be noted that in the amplification of both V^- and 
V^-coding DNA homologs the conserved 5' nucleotide 
sequence of the second primer can be complementary to a 
sequence exogenously added using terrninai deoxynucleoti- 

15 dyi transferase as described by Loh et al. Science, 
243:217-220 (1989). One or both of the first and second 
primers can contain a nucleotide sequence defining an 
endonudease recognition site. The site can be heterologous 
to the immunoglobulin gene being amplified and typically 

20 appears at or near the 5' end of the primer. 

When present, the restriction site-defining portion is typi- 
cally located in a 5Merminal non-priming portion of the 
primer. The restriction site defined by the first primer is 
typically chosen to be one recognized by a restriction 

25 enzyme that does not recognize the restriction site defined 
by the second primer, the objective being to be able to 
produce a DNA molecule having cohesive termini that are 
non-complementary to each other and thus allow directional 
insertion into a vector. 

30 In one embodiment the present invention utilizes a set of 
polynucleotides that form primers having a priming region 
located at the 3*-terminus of the primer. The priming region 
is typically the 3'-most (X-terminal) 15 to 30 nucleotide 
bases. The 3*-terminal priming portion of each primer is 

35 capable of acting as a primer to catalyze nucleic acid 
synthesis, i.e., initiate a primer extension reaction off its 3* 
terminus. One or bom of the primers can additionally 
contain a 5 -terminal (5 -most) non-priming portion, i.e.. a 
region that does not participate in hybridization to repertoire 

40 template. 

Id PCR, each primer works in combination with a second 
primer to amplify a target nucleic acid sequence. The choice 
of PCR primer pairs for use in PCR is governed by consid- 
erations as discussed herein for producing gene repertoires. 
45 That is. the primers have a nucleotide sequence that is 
complementary to a sequence conserved in the repertoire. 
Useful \ H and V t priming sequences are shown in Tables 5 
and 6, herein below. 

4. Polymerase Chain Reaction to Produce Gene Repertoires 

50 The strategy used for cloning the V w and V L genes . 
contained within a repertoire will depend, as is well known 
in the art on the type, complexity, and purity of the nucleic 
acids making up the repertoire. Other factors include 
whether or not the genes are contained in one or a plurality 

55 of repertoires and whether or not they are to be amplified 
and/or mutagenized. 

The and V £ -coding gene repertoires are comprised of 
polynucleotide coding strands, such as mRNA and/or the 
sense strand of genomic DNA. If the repertoire is in the form 

60 of double stranded genomic DNA. it is usually, first 
denatured, typically by melting, into single strands. A rep- 
ertoire is subjected to a PCR reaction by treating 
(contacting) the repertoire with a PCR primer pair, each 
member of the pair having a preselected nucleotide 

65 sequence. The PCR primer pair is capable of initiating 
primer extension reactions by hybridizing to nucleotide 
sequences, preferably at least about 10 nucleotides in length 
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and more preferably at least about 20 nucleotides in length, 
conserved within the repertoire. The first primer of a PGR 
primer pair is sometimes referred to herein as the "sense 
primer" because it hybridizes to the coding or sense strand 
of a nucleic acid. In addition, the second primer of a PCR 
primer pair is sometimes referred to herein as the "an tj- sense 
primer" because it hybridizes to a non-coding or anti-sense 
strand of a nucleic acid, i.e.. a strand complementary to a 
coding strand. 

The PCR reaction is performed by mixing the PCR primer 
pair, preferably a predetermined amount thereof, with the 
nucleic acids of the repertoire, preferably a predetermined 
amount thereof, in a PCR buffer to form a PCR reaction 
admixture. The admixture is maintained under polynucle- 
otide synthesizing conditions for a time period, which is 
typically predetermined, sufficient for the formation of a 
PCR reaction product thereby producing a plurality of 
different V^-coding and/or V r -coding DNA homologs. 

A plurality of first primer and/or a plurality of second 
primers can be used in each amplification, e.g.. one species 
of first primer can be paired with a number of different 
second primers to form several different primer pairs. 
Alternatively, an individual pair of first and second primers 
can be used. In any case, the amplification products of 
amplifications using the same or different combinations of 
first and second primers can be combined to increase the 
diversity of the gene library. 

In another strategy, the object is to clone the and/or 
V^-coding genes from a repertoire by providing a polynucle- 
otide complement of the repertoire, such as the anti-sense 
strand of genomic dsDNA or the polynucleotide produced 
by subjecting mRNA to a reverse transcriptase reaction- 
Methods for producing such complements are well known in 
the art 

The PCR reaction is performed using any suitable 
method. Generally it occurs in a buffered aqueous solution. 
Le.. a PCR buffer, preferably at a pH of 7-9, most preferably 
about 8. Preferably, a molar excess (for genomic nucleic 
acid, usually about 10 6 : 1 primer:template) of the primer is 
admixed to the buffer containing the template strand. A large 
molar excess is preferred to improve the efficiency of the 
process. 

The PCR buffer also contains the deoxyribonucleotide 
triphosphates dATP, dCTP. dGTR and dTTP and a 
polymerase, typically thermostable, all in adequate amounts 
for primer extension (polynucleotide synthesis) reaction. 
The resulting solution (PCR admixture) is heated to about 
90° C. -100° C. for about 1 to 10 minutes, preferably from 
1 to 4 minutes. After this heating period the solution is 
allowed to cool to 54° C. which is preferable for primer 
hybridization. The synthesis reaction may occur at from 
room temperature up to a temperature above which the 
polymerase (inducing agent) no longer functions efficiently. 
Thus, for example, if DNA polymerase is used as inducing 
agent, the temperature is generally no greater than about 40° 
C. An exemplary PCR buffer comprises the following: 50 
mM KQ; 10 mM Tris-HCl; pH 8.3; 1.5 mM Mgcl 2 ; 0.001% 
(wt/vol) gelatin, 200 uM dATP; 200 pM dTTP; 200 uM 
dCTP; 200 uM dGTP; and 2.5 units Thermus aquaticus 
DNA polymerase I (U.S. Pat. No. 4.889,818) per 100 
microliters of buffer. 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 
for this purpose include, for example. £ coli DNA poly- 
merase L Klenow fragment of E coli DNA polymerase L T4 
DNA polymerase, other available DNA polymerases. 
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reverse transcriptase, and other enzymes, including heat- 
stable enzymes, which will facilitate combination of the 
nucleotides in the proper manner to form the primer exten- 
sion products which are complementary to each nucleic acid 

5 strand. Generally, the synthesis will be initiated at the 3' end 
of each primer and proceed in the 5' direction along the 
template strand, until synthesis terminates, producing mol- 
ecules of different lengths. There may be inducing agents, 
however, which initiate synthesis at the 5* end and proceed 

to in the above direction, using the same process as described 
above. 

The inducing agent also may be a compound or system 
which will function to accomplish the synthesis of RNA 
primer extension products, including enzymes. In pref erred 

15 embodiments, the inducing agent may be a DNA-dependent 
RNA. polymerase such as T7 RNA polymerase. T3 RNA 
polymerase or SP6 RNA polymerase. These polymerases 
produce a complementary RNA polynucleotide. The high 
turn over rate of the RNA polymerase amplifies the starting 

20 polynucleotide as has been described by Chamberlin et al„ 
The Enzymes. edL R Boyer. PP. 87-108. Academic Press. 
New York (1982). Another advantage of T7 RNA poly- 
merase is that mutations can be introduced into the poly- 
nucleotide synthesis by replacing a portion of cDNA with 

25 one or more mutagenic oligodeoxynucleotides 
(polynucleotides) and transcribing the partially-mismatched 
template directly as has been previously described by Joyce 
et aL. Nuc. Acid Res^ 17:711-722 (1989). Amplification 
systems based on transcription have been described by 

30 Gingeras et aL, in PCR Protocols, A Guide to Methods and 
Applications, pp 245-252. Academic Press. Inc.. San Diego. 
Calif. (1990). 

If the inducing agent is a DNA-dependent RNA poly- 
merase and therefore incorporates ribonucleotide 

35 triphosphates, sufficient amounts of ATP. CTP. GTP and 
UTP are admixed to the primer extension reaction admixture 
and the resulting solution is treated as described above. 

The newly synthesized strand and its complementary 
nucleic acid strand form a double-stranded molecule which 

40 can be used in the succeeding steps of the process. 

The first and/or second PCR reactions discussed above 
can advantageously be used to incorporate into the receptor 
a preselected epitope useful in immunologically detecting 
and/or isolating a receptor. This is accomplished by utilizing 

45 a first and/or second polynucleotide synthesis primer or 
expression vector to incorporate a predetermined amino acid 
residue sequence into the amino acid residue sequence of the 
receptor. 

After producing and V^-coding DNA homologs for a 

50 plurality of different V^- and V^-coding genes within the 
repertoires, the DNA molecules are typically further ampli- 
fied. While the DNA molecules can be amplified by classic 
techniques such as incorporation into an autonomously 
replicating vector, it is preferred to first amplify the moJ- 

55 ecules by subjecting them to a polymerase chain reaction 
(PCR) prior to inserting them into a vector. PCR is typically 
carried out by thermocycling i.e., repeatedly increasing and 
decreasing the temperature of a PCR reaction admixture 
within a temperature range whose lower limit is about 10° C. 

60 to about 40° C. and whose upper limit is about 90° C. to 
about 100° C. The increasing and decreasing can be 
continuous, but is preferably phasic with time periods of 
relative temperature stability at each of temperatures favor- 
ing polynucleotide synthesis, denaturation and hybridiza- 

65 tion. 

PCR amplification methods are described in detail in U.S. 
Pat Nos. 4.683.192. 4.683.202. 4.800.159, and 4.965.188. 
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and at least in several texts including "PCR Technology: 
Principles and Applications for DNA Amplification". H. 
Erlich. ed.. Stockton Press, New York (1989); and "PCR 
Protocols: A Guide to Methods and Applications". Innis et 
al.. eds.. Academic Press. San Diego. Calif. (1990). 

In preferred embodiments only one pair of first and 
second primers is used per amplification reaction. The 
amplification reaction products obtained from a plurality of 
different amplifications, each using a plurality of different 
primer pairs, are then combined. 

However, the present invention also contemplates DNA 
homolog production via co- amplification (using two pairs of 
primers), and multiplex amplification (using up to about 8. 
9 or 10 primer pairs). 

In preferred embodiments, the PCR process is used not 
only to produce a library of DNA molecules, but also to 
induce mutations within the library or to create diversity 
from a single parental clone and thereby provide a library 
having a greater heterogeneity. First, it should be noted that 
the PCR process itself is inherently mutagenic due to a 
variety of factors well known in the art Second, in addition 
to the mutation inducing variations described in the above 
referenced U.S. Pat No. 4.683.195. other mutation inducing 
PCR variations can be employed. For example, the PCR 
reaction admixture, can be formed with different amounts of 
one or more of the nucleotides to be incorporated into the 
extension product. Under such conditions, the PCR reaction 
proceeds to produce nucleotide substitutions within the 
extension product as a result of the scarcity of a particular 
base. Similarly, approximately equal molar amounts of the 
nucleotides can be incorporated into the initial PCR reaction 
admixture in an amount to efficiently perform X number of 
cycles, and then cycling the admixture through a number of 
cycles in excess of X. such as. for instance. 2X. 
Alternatively, mutations can be induced during the PCR 
reaction by incorporating into the reaction admixture nucle- 
otide derivatives such as inosine. not normally found in the 
nucleic acids of the repertoire being amplified. During 
subsequent in vivo amplification, the nucleotide derivative 
will be replaced with a substitute nucleotide thereby induc- 
ing a point mutation. 

5. Linear DNA Expression Vectors 

A DNA expression vector for use in a method of the 
invention for producing a library of DNA molecules is a 
linearized DNA molecule as described before having two 
(upstream and downstream) cohesive termini adapted for 
directional ligation to a polypeptide gene. 

A linear DNA expression vector is typically prepared by 
restriction endonucleasc digestion of a circular DNA expres- 
sion vector of this invention to cut at two preselected 
restriction sites within the sequence of nucleotides of the 
vector adapted for directional ligation to produce a linear 
DNA molecule having the required cohesive termini that are 
adapted for direction ligation. Directional ligation refers to 
the presence of two (a first and second) cohesive tennini on 
a vector, or on the insert DNA molecule to be ligated into the 
vector selected, so that the tennini on a single molecule are 
not complementary. A first terminus of the vector is comple- 
mentary to a first terminus of the insert and the second 
terminus of the vector is complementary to the second 
terminus of the insert 

6. Ligation Reactions to Produce Gene Libraries 

In preparing a library of DNA molecules of this invention, 
a ligation admixture is prepared as described above, and the 
admixture is subjected to ligation conditions for a time 
period sufficient for the admixed repertoire of polypeptide 
genes to ligate (become operatively United) to the plurality 
of DNA expression vectors to form the library. 
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Ligation conditions are conditions selected to favor a 
ligation reaction wherein a phosphodi ester bond is formed 
between adjacent 3* hydroxyl and 5' phosphoryl termini of 
DNA. The ligation reaction is preferably catalyzed by the 

5 enzyme T4 DNA ligase. Ligation conditions can vary in 
time, temperature, concentration of buffers, quantities of 
DNA molecules to be ligated. and amounts of ligase, as is 
well known. Preferred ligation conditions involve maintain- 
ing the ligation admixture at 4 degrees Centigrade (4° C.) to 

10 12° C. for 1 to 24 hours in the presence of 1 to 10 units of 
T4 DNA ligase per milliliter (ml) and about 1 to 2 micro- 
grams (ug) of DNA. Ligation buffer in a ligation admixture 
typically contains 0.5M Tris-HCl (pH 7.4). 0.01M MgCl 2 . 
0.01M dithiothrietol. 1 mM spermidine. 1 mM ATP and 0.1 

15 mg/ml bovine serum albumin (BSA). Other ligation buffers 
can also be used. 

Exemplary ligation reactions are described in Example 2. 
7. Preparation of Dicistronic Gene Libraries 
In a particularly preferred embodiment the present inven- 

2Q tion contemplates methods for the preparation of a library of 
dicistronic DNA molecules. A dicistronic DNA molecule is 
a single DNA molecule having the capacity to express two 
separate polypeptides from two separate cistrons. In pre- 
ferred embodiments, the two cistrons are operatively linked 

22 at relative locations on the DNA molecule such that both 
cistrons are under the transcriptional control of a single 
promoter. Each dicistronic molecule is capable of expressing 
first and second polypeptides from first and second cistrons. 
respectively, that can form, in a suitable host, a het- 

3Q erodimeric receptor on the surface of a filamentous phage 
particle. 

The method for produdng a library of dicistronic DNA 
molecules comprises the steps of: 

(a) Forming a first ligation admixture by combining in a 
35 ligation buffer: 

(i) a repertoire of first polypeptide genes in the form of 
dsDNA. each having cohesive termini adapted for 
directional ligation, and 

(ii) a plurality of DNA expression vectors in linear 
40 form, each having upstream and downstream first 

cohesive tennini that are (a) adapted for directionally 
receiving the first polypeptide genes in a common 
reading frame, and (b) operatively linked to respec- 
tive upstream and downstream translatable DNA 

45 sequences. The upstream translatable DNA sequence 

encodes a pelB secretion signaL the downstream 
translatable DNA sequence encodes a filamentous 
phage coat protein membrane anchor, and translat- 
able DNA sequences are operatively linked to 

50 respective upstream and downstream DNA expres- 

sion control sequences. 

(b) Subjecting the admixture to ligation conditions for a 
time period sufficient to operatively link the first 
polypeptide genes to the vectors and produce a plurality 

55 of circular DNA molecules each having a first cistron 
for expressing the first polypeptide. 

(c) Treating the plurality of circular DNA molecules under 
DNA cleavage conditions to produce a plurality of 
DNA expression vectors in linear form that each have 

60 upstream and downstream second cohesive tennini that 
are (i) adapted for directionally receiving a repertoire of 
second polypeptide genes in a common reading frame, 
and (ii) operatively linked to respective upstream and 
downstream DNA sequences. The upstream DNA 

65 sequence is a translatable sequence encoding a secre- 
tion signaL the downstream DNA sequence has at least 
one stop codon in the reading frame, and the translat- 
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able DNA sequence is operatively linked to a DNA 
expression control sequence. 

(d) Forming a second ligation admixture by combining in 
a ligation buffer: 

(i) the plurality of DNA expression vectors formed in 
step (c). and 

(ii) the repertoire of second polypeptide genes in the 
form of dsDNA. each having cohesive termini 
adapted for directional ligation to the plurality of 
DNA expression vectors; and 

(e) Subjecting the second admixture to ligation conditions 
for a time period sufficient to operatively link the 
second polypeptide genes to said vectors and produce 
a plurality of circular DNA molecules each having the 
second cistron for expressing the second polypeptide, 
thereby forming the library. In preferred embodiments 
a secretion signal is a pelB secretion signal, and the 
membrane anchor is derived from cpVTU as described 
herein. 

DNA expression vectors useful for practicing the above 
method are the dicistronic expression vectors described in 
greater detail before. 

In practicing the method of producing a library of dicis- 
tronic DNA molecules, it is preferred that the upstream and 
downstream first cohesive termini do not have the same 
nucleotide sequences as the upstream and downstream sec- 
ond cohesive termini. In this embodiment, the treating step 
(c) to linearize the circular DNA molecules typically 
involves the use of restriction endonucleases that are specific 
for producing said second termini, but do not cleave the 
circular DNA molecule at the sites that formed the first 
termini. Exemplary and preferred first and second termini 
are the termini defined by cleavage of p€BAK8 with Xho I 
and Spe I to form the upstream and downstream first termini, 
and defined by cleavage of pCBAK8 with Sac I and Xba I 
to form the upstream and downstream second termini. In this 
embodiment, other pairs of cohesive termini can be utilized 
at the respective pairs of first and second termini, so long as 
the four termini are each distinct, non-complementary ter- 
mini. 

Methods of treating the plurality of circular DNA mol- 
ecules under DNA cleavage conditions to form linear DNA 
molecules are generally well known and depend on the 
nucleotide sequence to be cleaved and the mechanism for 
cleavage. Preferred treatments involve admixing the DNA 
molecules with a restriction endonudease specific for a 
endonuclease recognition site at the desired cleavage loca- 
tion in an amount sufficient for the restriction endonuclease 
to cleave the DNA molecule. Buffers, cleavage conditions, 
and substrate concentrations for restriction endonuclease 
cleavage are well known and depend on the particular 
enzyme utilized. Exemplary restriction enzyme cleavage 
conditions are described in Example 2. 

E Phage Libraries 

The present invention contemplates a library of DNA 
molecules that each encode a fusion protein of this invention 
where the library is in the form of a population of different 
filamentous phage particles each containing one of the 
different rDNA molecules. By different ,I>NA molecule is 
meant rDNA molecules differing in nucleotide base 
sequence encoding a polypeptide of this invention. 

Thus, a phage library is a population of filamentous 
phage, preferably fl. fd or M13 filamentous phage, each 
phage having packaged inside the particle a rDNA expres- 
sion vector of this invention. A preferred library is com- 
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prised of phage particles containing DNA molecules that 
encode at least 10 6 . preferably 10 T and more preferably 
10®~* different fusion proteins of this invention. By different 
fusion proteins is meant fusion proteins differing in amino 

5 acid residue sequence. Where the packaged expression vec- 
tor encodes first and second polypeptides of an autogenously 
assembling receptor, e.g. V w and polypeptides that form 
a Fab. the library can also be characterized as containing or 
expressing a multiplicity of receptor specificities. Thus. 

10 preferred libraries express at least 10 3 . preferably at least 10 6 
and more preferably at least 10 7 different receptors, such as 
different antibodies. T cell receptors, in tc grins and the like. 

As described herein, a particular advantage of a filamen- 
tous phage in the present invention is that the DNA molecule 

15 present in the phage particle and encoding one or both of the 
members of the heterodimeric receptor can be segregated 
from other DNA molecules present in the library on the basis 
of the presence of the particular expressed fusion protein the 
surface of the phage particle. 

20 Isolation (segregation) of a DNA molecule encoding one 
or both members of a heterodimeric receptor is conducted by 
segregation of the filamentous phage particle containing the 
gene or genes of interest away from the population of other 
phage particles comprising the library. Segregation of phage 

25 particles involves the physical separation and propagation of 
individual phage particles away from other particles in the 
library. Methods for physical separation of filamentous 
phage particles to produce individual particles, and the 
propagation of the individual particles to form populations 

30 of progeny phage derived from the individual segregated 
particle are well known in the filamentous phage arts. 

A preferred separation method involves the identification 
of the expressed heterodimer on the surface of the phage 

35 particle by means of a ligand binding specificity between the 
phage particle and a preselected ligand. Exemplary and 
preferred is the use of 'Spanning" methods whereby a sus- 
pension of phage particles is contacted with a solid phase 
ligand (antigen) and allowed to specifically bind (or immu- 
noreact where the heterodimer includes an immunoglobulin 
variable domain). After binding, non-bound particles are 
washed off the solid phase, and the bound phage particles are 
those that contain ligand-specific heterodimeric receptor 
(heterodimer) on their surface. The bound particles can then 
be recovered by elution of the bound particle from the solid 

4 phase, typically by the use of aqueous solvents having high 
ionic strength sufficient to disrupt the receptor-ligand bind- 
ing interaction. 

An alternate method for separating a phage particle based 

50 on the ligand specificity of the surface-expressed het- 
erodimer from a population of particles is to precipitate the 
phage particles from the solution phase by crosslinkage with 
the ligand. An exemplary and preferred cross ! inkin g and 
precipitation method is described in detail in Example 4c 

55 The use of the above particle segregation methods pro- 
vides a means for screening a population of filamentous 
phage particles present in a phage library of this invention. 
As applied to a phage library, screening can be utilized to 
enrich the library for one or more particles that express a 

60 hetercKiimcr having a preselected ligand binding specificity. 
Where the library is designed to contain multiple species of 
heterodimers that all have some detectable measure of 
ligand binding activity, but differ in protein structure, 
antigenicity, ligand binding affinity or avidity, and the like, 

65 the screening methods can be utilized sequentially to first 
produce a library enriched for a preselected binding 
specificity, and then to produce a second library further 
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enriched by further screening comprising one or more wild-type cpVTH is transcribed in the host cell than fusion 

isolated phage particles. Methods for measuring ligand proteins, thereby leading to increased ratios of fusion protein 

binding activities, antigenicity and the like interactions compared to cpVTTI during the extrusion process, 

between a ligand and a receptor are generally well known in another embodiment, the amount of heterodiraeric 

and are not discussed further as they are not essential 5 rece pt or 0 n the phage particle surface can be controlled by 

features of the present invention. controlling the timing between expression of fusion proteins 

Thus in one embodiment, a phage library is a population and the superinfection by helper phage. After introduction of 

of particles enriched for a preselected ligand binding sped- the expression vector, longer delay times before the addition 

fici ^ of helper phage will allow for increased accumulation of the 

In 'another embodiment, a phage library comprises a 10 fusion proteins in the host celL 

population of particles wherein each particle contains at EXAMPLES 
least one fusion protein of this invention on the surface of 

the phage particle. The actual amount of fusion protein The foUowing examples are intended to illustrate, but not 

present on the surface of a phage particle depends, in part. ^ ^ thc scopc of ^ invention, 
on the choice of coat protein membrane anchor present in the 

fusion protein. Where the anchor is derived from cpIIL there j Construction of a Dicistronic Expression Vector 

are typically about 1 to 4 fusion proteins per phage particle. for pouring a Heterodimeric Receptor on Phage 

Where the anchor is derived from the more preferred cpVUL Particles 
there is the potential for hundreds of fusion proteins on the 

particle surface depending on the growth conditions and ** To obtain a vector system for generating a large number 
other factors as discussed herein. Preferably, a phage particle of Fab antibody fragments that can be screened directly, 
in a library contains from about 10 to about 500 cpVUI- expression libraries in bacteriophage Lambda have previ- 
derived fusion proteins on thc surface of each particle, and ously been constructed as described in Huse et al.. Science, 
more preferably about 20 to 50 fusion proteins per particle. 246:1275-1281 (1989). These systems did not contain 
Exemplary amounts of surface fusion protein are shown by 25 design features that provide for the expressed Fab to be 
the electron micrographs described in Example 4a that targeted to thc surface of a filamentous phage particle, 
describe particles having about 20 to 24 cpVUI-derived The main criterion used in choosing a vector system was 
fusion proteins per particle. the necessity of generating the largest number of Fab frag- 
In another embodiment, the present invention contem- ^ men ts which could be screened directly- Bacteriophage 
plates a population of phage particles that are the progeny of Lambda was selected as the starting point to develop an 
a single particle, and therefor all express the same het- expression vector for three reasons. First in vitro packaging 
erodimcr on the particle surface. Such a population of phage 0 f phage DNA was the most efficient method of reintroduc- 
are homogeneous and donally derived, and therefore pro- i ng DNA into host cells. Second, it was possible to detect 
vide a source for expressing large quantities of a particular 35 protein expression at the level of single phage plaques, 
fusion protein. An exemplary clonalry homogeneous phage Finally, the screening of phage libraries typically involved 
population is described in Example 4. less difficulty with nonspecific binding. The alternative. 

A filamentous phage particle in a library of this invention plasmid cloning vectors, are only advantageous in the analy- 

is produced by standard filamentous phage particle prepa- sis of clones after they have been identified. This advantage 

ration methods and depends on the presence in a DNA « was not lost in the present system because of the use of a 

expression vector of this invention of a filamentous phage dicistronic expression vector such as pCombVm. thereby 

origin of replication as described herein to provide the r^rmitting a plasmid containing the heavy chain, light cham, 

signals necessary for (1) production of a single-stranded or Fab expressing inserts to be excised, 
filamentous phage replicative form and (2) packaging of the 

repUcativc form into a filamentous phage particle. Such a 45 Construction of ^stromc Expression Vector 

DNA molecule can be packaged when present in a bacterial pCOMB 

cell host upon introduction of genetic complementation to (i) Preparation of Lambda Zap™ II 

provide the filamentous phage proteins required for produc- Lambda Zap™ II is a derivative of the original Lambda 
tion of infectious phage particles. A typical and preferred* Zap (XTCC #40.298) that maintains all of the characteristics 

method for genetic complementation is to infect a bacterial 50 of the original Lambda Zap including 6 unique cloning sites, 

host cell containing a DNA expression Vector of this inven- fusion protein expression, and the ability to rapidly excise 

tion with a helper filamentous phage, thereby providing the the insert in the form of a phagemid (Bluescript SK-), but 

genetic elements required for phage particle assembly. lacks the SAM 100 mutation- allowing growth on many 

Exemplary helper rescue methods are described herein at Non-Sup F strains, including XL 1 -Blue. The Lambda Zap™ 

Example 2. and described by Short et at. Nuc. Acids Res., 55 n was constructed as described in Short et al. M#c. Acids 

16:7583-7600 (1988). Res., 16:7583-7600, 1988. by replacing the Lambda S gene 

The level of heterc>dimeric receptor captured on the contained in a 4254 base pair (bp) DNA fragment produced 

surface of a filamentous phage particle during the process of by digesting Lambda Zap with the restriction enzyme £co L 

phage particle extrusion from the host cell can be controlled This 4254 bp DNA fragment was replaced with the 4254 bp 
by a variety of means. In one embodiment, the levels of 60 DNAfragment containing the lambda S gene isokted from 

£sion F otL^ ^abda *W (*TCC #40.179) after dusting the vector 

in the first and second cistrons for expressing the with the restriction enzyme Nco L The 4254 bp DNA 

polypeptides, such that transcription of the fusion protein fragment isolated from lambda mt0 ^ 
dsTxons occurs at a relative rate greater than the rate of original Larnbda Zap vector using T4I DNA hgase and 

transcription of the cpVUI gene on the helper phage. In 65 standard protocols such as those ^scnbedin Current Pro- 
another embodiment, the helper phage can have an amber tocoh in Molecular Biology. Ausub^ let d..eds John Wiley 
mutation in the gene for expressing cpVTH. such that less and Sons, NY. 1987. to form Lambda Zap™ H 
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(ii) Preparation of Lambda Hc2 solution was maintained at 37° C for 30 minutes and then 

To express a plurality of V^oding DNAhomologs in an theT4 polynucleotide kinase was inactivated by niaintaining 
E coli host cell, a vector designated Lambda Hc2 was the solution at 65° C for 10 minutes. 



TABLE 3 



SEQ. 

n>. no. 



(22) Nl) 5' GCKICarAAATTCXAXrTCAAGGAGACACriCXr 3' 

(23) N2) 5* A-ATGAAAIACCTAlTGCXTTACGCKrAGCCGClGGATT 3' 

(24) N3) 5* OTTATTACTCGCTGCCC AACC AGCC ATCGCCC 3* 

(25) N6) 5* CAGTTTCACCTGGGCCATGGC J UG TTGGG 3* 

(26) N7) 5' CAGCGAGTAAlAACAATCCAGCGGCTGCCCrAGGCAATAG 3* 

(27) N8) 5' GTATTTC ATTATG ACTCnTTTCCTTGAAAlAG AATTT^ 3' 

(28) N9-4) 5' ACKJIT3 AAACTGCTC G AG ATTTCTAG ACTAGTTACCCGTAC 3' 

(29) N10-5) 5' CGGAACGrTCGTACGGGX\ACTAGTCTAGAAATCTCGAG 7 

(30) Nil) 5' GACGTrCCGGACTACGGTICTTAAlAGAATrcG 7 

(31) N12) y TCGACGAAITCTATIAAGAACCaiAGTC 3' 



constructed. The vector provided the following: the capacity 20 The completed synthetic DNA insert was ligated dkectly 

to place the V^oding DNA fromologs in the proper reading into the Lambda Zap™ H vector describe^ m Buimple la(i) 

ro piacc uic v ^ wujg i/iin;wuw b /_ iT. ^ , that had been oreviouslv dicested with the restriction 

frame; a ribosome binding site as described by Shine et aL. had been £ev£u y g was ^ 

Nature, 254:34. 1975; a leader sequence directing the J^"^^" ^ 1 ^ ac S Ke - s instructions tising 

expressed protein to the peripiasrmc space designated the ^ Gigapackn pac king extract available from Stratagene. 

pelB secretion signal; a polynucleotide sequence that coded ^ j oUa ^ Calif Thc panged ligation mixture was plated on 

for a known epitope (epitope tag); and also a polynucleotide XLl-Blue cells (Stratagene). Individual lambda plaques 

that coded for a spacer protein between the V^-coding DNA were ^Tcd and the inserts excised according to the in vivo 

homolog and the polynucleotide coding for the epitope tag. exc i s ion protocol for Lambda Zap™ II provided by the 

Lambda Hc2 has been previously described by Huse et al.. ^ manu f a cturcr (Stratagene). This in vivo excision protocol 

Science, 246:1275-1281 (1989). moved thc cloned insert from thc Lambda Hc2 vector into a 

To prepare Lambda Hc2. a synthetic DNA sequence phagemid vector to allow easy for manipulation and 

containing all of the above features was constructed by sequencing. The accuracy of the above cloning steps was 

designing single stranded polynucleotide segments of 20-40 corjfirmed by sequencing the insert using the Sanger dideoxy 

bases that would hybridize to each other and form the double 3J method described in by Sanger et ah. Proc. Natl. Acad. Sci. 

stranded synthetic DNA sequence shown in FIG. 3. The VSA 74:5463-5467. (1977) and using the manufacture's 

individual single- stranded polynucleotide segments are instructions in the AMV Reverse Transcriptase S-AXP 

shown in Table 3. sequencing kit (Stratagene). The sequence of the resulting 

Polynucleotides N2. N3, N9-4, NIL N10-5, N6, N7 and double- stranded synthetic DNA insert in the V„ expression 

N8 (Table 3) were kinased by adding 1 ul of each poly- 40 vector (Lambda Hc2) is shown in FIG. 3. The sequence of 

nucleotide 0.1 rmcrograms/rnicroliter (ug/pl) and 20 units of each strand (top ^^^l^^^l 1 " f** ™ *5 

*Y polynucleotide kinase to a solution containing 70 mM sequence listing as SEQ. ID. NO. 1 and SEQ. ID. NU. z, 

Tris-HCl. pH 7.6, 10 mM MgCl 2 , 5 mM cUttoothreitol respectively. The resultant Lambda Hc2 expression vector is 

(DTT). 10 mM beta-mercaptoethanoK 500 micrograms per shown in FIG. 4. 

milliliter (ug/ml) bovine serum albumin (BSA). The solution 43 (Hi) Preparation of Lambda Lc2 

was niainiained at 37 degrees Centigrade (37° C) for 30 To express a plurality of V t -coding DNAhomologs m an 

minutes and the reaction stopped by maintaining the solution E. coli host cell, a vector designated Lambda Lc2 was 

at 65° C for 10 minutes. The two end polynucleotides, 20 constructed having the capacity to place the V^-coojng DNA 

ng of polynucleotides Nl and polynucleotides N12. were homologs in the proper reading frame, provided a nbosome 

added to the above kinasing reaction solution together with 50 binding site as described by Shine et al.. Nature, 254:34 

Vio volume of a solution containing 20.0 mM Tris-HCL pH (1975). provided the pelB gene leader sequence secretion 

7.4, 2.0 mM MgQ 2 and 50.0 mM Nad This solution was signal that has been previously used to successfully secrete 

heated to 70° C. for 5 minutes and allowed to cool to room Fab fragments in E. coli by Lei e* al 7. Boc 169:4379 

temperature, approximately 25° C, over 1.5 hours in a 500 ( 1987) and Better et aL. Science, 240: 1041 (1988), and also 

nd beaker of water. During this time period all 10 poly- 55 provided a polynucleotide containing a resection endonu- 

nucleotides annealed to form the double stranded synthetic clease site for cloning. Lambda ^J™^^™** 

DNAmsertshownmnG.3.ThemaiWdualpolynudec<ides described by Huse et al.. Science, 246:1275-1281 (1989). 

were covalentry linked to each other to stabilize the syn- A synthetic DNA sequence containing all of the^above 

thetic DNA insert by adding 40 pi of the above reaction to features was constructed by designing single stranded poly- 

a solution containing 50 mM TYis-HCL pH 7.5. 7 mM 60 ™deotide segments o^ 

MgCL.lmMOTT. linMa&nosme triphosphate (ATP) and each other and form the double stranded synthcuc DNA 

10 units of T4 DNA ligase. This solution was maintained at sequence shown in FIG. 5. The sequence of each mcUvidual 

37° C for 30 minutes and then the T4 DNA ligase was single-stranded polynucleotide segment (01-08) within the 

inactivated by mamtaining the solution at 65* C for 10 double stranded synthetic DNA sequence is shown in Table 
minutes. The end polynucleotides were kinased by mixing 65 4. 

52 ul of the above reaction. 4 ul of a solution containing 10 Polynucleotides 02, 03. 04, 05. 06 and 07 (Table 4) were 

mM ATP and 5 units of T4 polynucleotide kinase. This kinased by adding 1 ul (0.1 ug^ul) of each polynucleotide 
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and 20 units of T 4 polynucleotide kinase to a solution Shine-Dalgamo ribosome binding site as shown in the 

containing 70 mM Tris-HCl. pH 7.6. 10 mM MgCl. 5 mM sequence in FIG. 5 and in SEQ. ID. NO. 3. A Spe I 

DTT. 10 mM bcta-mercaptoethanol. 500 nag/ml of BSA. The restriction site is also present in Lambda Hc2 as shown in 

solution was maintained at 37° C. for 30 minutes and the FIGS. 3 and 4 and in SEQ. ID. NO. 1. A combinatorial 

reaction stopped by maintaining the solution at 65° C for 10 5 vector, designated pComb. was constructed by combining 

minutes. The 20 ng each of the two end polynucleotides. 01 portions of Lambda Hc2 and Lc2 together as described in 

and 08. were added to the above kinasing reaction solution Example la(iv) below. The resultant combinatorial pComb 

together with '/io volume of a solution containing 20.0 mM vector contained two Spe I restriction sites, one provided by 

Tris-HCl. pH 7.4. 2.0 mM MgCI and 15.0 mM sodium Lambda Hc2 and one provided by Lambda Lc2. with an 

chloride (NaCl). This solution was heated to 70° C. for 5 10 EcoR I site in between. Despite the presence of two Spe I 

minutes and allowed to cool to room temperature, approxi- restriction sites, DNA homologs having Spe I and EcoR I 

mately 25° C. over 1.5 hours in a 500 ml beaker of water. cohesive termini were successfully directionally ligated into 

During this time period all 8 polynucleotides annealed to a pComb expression vector previously digested with Spe I 

form the double stranded synthetic DNA insert shown in and EcoR I as described in Example lb below. The prox- 

F1G. 5. The individual polynucleotides were covalently 15 unity of the EcoR I restriction site to the 3* Spe I site, 

linked to each other to stabilize the synthetic DNA insert by provided by the Lc2 vector, inhibited the complete digestion 

adding 40 ul of the above reaction to a solution containing of the 3* Spe I site. Thus, digesting pComb with Spe I and 

50 ml Tris-HCl. pH 7.5. 7 ml MgQ. 1 mm DTT. 1 ram ATP EcoR I did not result in removal of the EcoR I site between 

and 10 units of T4 DNAligase. This solution was maintained the two Spe I sites. 

at 37° C. for 30 minutes and then the T4 DNA iigase was 20 The presence of a second Spe I restriction site may be 

inactivated by maintaining the solution at 65° C for 10 undesirable for ligations into a pComb vector digested only 

minutes. The end polynucleotides were kinased by mixing with Spe I as the region between the two sites would be 

52 ul of the above reaction. 4 ul of a solution containing 10 eliminated. Therefore, a derivative of Lambda Lc2 lacking 

mM ATP and 5 units of T4 polynucleotide kinase. This the second or 3' Spe I site, designated Lambda 1x3. is 

solution was maintained at 37° C for 30 minutes and then 25 produced by first digesting Lambda Lc2 with Spe I to form 

theT4 polynucleotide kinase was inactivated by maintaining a linearized vector. The ends are filled in to form blunt ends 

the solution at 65° C. for 10 minutes. which are ligated together to result in Lambda Lc3 lacking 

TABLE 4 



SEQ. 
ID. NO. 



(32) 01) 5' TGAATTCTAAACTAGTCGCCAAGGAGACAGTCAr 3' 

(33) 02) 5" AATOAAATACXriXrrDCCTACGGCAGCCGCTXKjATT 3' 

(34) 03) y gttatiactccx:tgcccaaccac^x:atggcc 3* 

(35) 04) 5' GAGCTCGrrCAGrrTCTAGAGTTAAGCGGCCG 3' 

p6) 05) 5' GTAlTrCATrATGACIUltlLCl 1 UGCC ACTAGTTTAG AA- 
TTCAAGCT3' 

(37) 06) 5' CAGCGAGTAAlAACAAItXIAGCGGCTGCCGTAGGCAArAG 3' 

(38) 07) y TGACX3AGCTCGGCC ATCGC TGCTTGGG 3' 
P9) 08) y TCGACGGCCOCTTAACTCTAGAAC 3' 



The completed synthetic DNA insert was ligated directly 
into the Lambda Zap™ II vector described in Example 
l(a)(i) that had been previously digested with the restriction 
enzymes Sac I and Xho L The ligation mixture was pack- 
aged according to the manufacture's instructions using 
Gigapack U Gold packing extract (Stratagene). The pack- 
aged ligation mixture was plated on XL1 -Blue cells 
(Stratagene). Individual lambda plaques were cored and the 
inserts excised according to the in vivo excision protocol for 
Lambda Zap™ II provided by the manufacturer 
(Stratagene). This in vivo excision protocol moved the 
doned insert from the Lambda Lc2 vector into a plasmid 
phagemid vector allow for easy manipulation and sequenc- 
ing. The accuracy of the above cloning steps was confirmed 
by sequencing the insert using the manufacture's instruc- 
tions in the AMV Reverse Transcriptase 35 S-dXTP sequenc- 
ing kit (Stratagene). The sequence of the resulting Lc2 
expression vector (Lambda Lc2) is shown in FIG. 5. Each 
strand is separately listed in the Sequence Listing as SEQ. 
ID. NO. 3 and SEQ. ID. NO. 4. The resultant Lc2 vector is 
schematically diagrammed in FIG. 6. 

A preferred vector for use in this invention, designated 
Lambda Lc3. is a derivative of Lambda Lc2 prepared above. 
Lambda Lc2 contains a Spe I restriction site (ACTAGT) 
located 3' to the EcoR I restriction site and 5' to the 



a Spe I site. Lambda Lc3 is a preferred vector for use in 
constructing a combinatorial vector as described below. 

45 (iv) Preparation of pComb 

Phagemids were excised from the expression vectors 
Lambda Hc2 or Lambda Lc2 using an in vivo excision 
protocol described above. Double stranded DNA was pre- 
pared from the phagemid-containing cells according to the 

50 methods described by Holmes et aL. Anal Biochem., 
114:193 (1981). The phagemids resulting from in vivo 
excision contained the same nucleotide sequences for anti- 
body fragment cloning and expression as did the parent 
vectors, and are designated phagemid Hc2 and Lc2. corre- 

55 sponding to Lambda Hc2 and Lc2, respectively. 

For the construction of combinatorial phagemid vector 
pComb. produced by combining portions of phagemid Hc2 
and phagemid Lc2, phagemid Hc2 was first digested with 
Sac I to remove the restriction site located 5' to the LacZ 
promoter. The linearized phagemid was then blunt ended 

60 with T4 polymerase and ligated to result in a Hc2 phagemid 
lacking a Sac I site. The modified Hc2 phagemid and the Lc2 
phagemid were then separately restriction digested with Sea 
I and EcoR I and the linearized phagemids were ligated 
together at their respective cohesive ends. The ligated 

65 phagemid vector was then inserted into an appropriate 
bacterial host and transforraants were selected on the anti- 
biotic ampicillin. 
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Selected ampicillin resistant transformants were screened 
for the presence of two Not I sites. The resulting ampicillin 
resistant combinatorial phageraid vector was designated 
pComb, the schematic organization of which is shown in 
FIG. 7. The resultant combinatorial vector. pComb. con- 
sisted of a DNA molecule having two cassettes to express 
two fusion proteins and having nucleotide residue sequences 
for the following opcratively linked elements listed in a 5* to 
3' direction: a first cassette consisting of an inducible LacZ 
promoter upstream from the LacZ gene; a Not I restriction 
site; a ribosome binding site; a pelB leader; a spacer; a 
cloning region bordered by a 5' Xho and 3* Spe I restriction 
site; a decapeptide tag followed by expression control stop 
sequences; an EcoR I restriction site located 5* to a second 
cassette consisting of an expression control ribosome bind- 
ing site; a pelB leaden a spacer region; a cloning region 
bordered by a 5' Sac I and a 3* Xba I restriction site followed 
by expression control stop sequences and a second Not I 
restriction site. 

A preferred combinatorial vector for use in this invention, 
designated pComb2, is constructed by combining portions 
of phagemid Hc2 and phagemid Lc3 as described above for 
preparing pComb. The resultant combinatorial vector, 
pComb2, consists of a DNA molecule having two cassettes 
identical to pComb to express two fusion proteins identically 
to pComb except that a second Spe I restriction site in the 
second cassette is eliminated, 

b. Construction of Vectors pCombVUI and 
pCombm for Expressing Fusion Proteins Having a 
Bacteriophage Coat Protein Membrane Anchor 

Because of the multiple endonuclease restriction cloning 
sites, the pComb phagemid expression vector prepared 
above is a useful cloning vehicle for modification for the 
preparation of an expression vector of this invention. To that 
end. pComb is digested with EcoR I and Spe I followed by 
phosphatase treatment to produce linearized pComb. 

(i) Preparation of pCombVEDt 

A PCR product produced in Example 2g and having a 
nucleotide sequence that defines a filamentous bacterioph- 
age coat protein VTH (cpVUI) membrane anchor domain and 
cohesive Spe I and EcoR I termini was admixed with the 
linearized pComb to form a ligation admixture. The cpVIH- 
membrane anchor-encoding PCR fragment was direction- 
ally ligated into the pComb phagemid expression vector at 
corresponding cohesive termini, that resulted in forming 
pCombVUI (also designated pComb8). pCombVUI contains 
a cassette defined by the nucleotide sequence shown in SEQ. 
ID. NO. 116 from nucleotide base 1 to base 208. and 
contains a pelB secretion signal operatively linked to the 
cpVUI membrane anchor. 

A preferred phagemid expression vector for use in this 
invention, designated either pComb2-VHl or pCorab2-8, is 
prepared as described above by directionally ligating the 
cpVTH membrane anchor-encoding PCR fragment into a 
pComb2 phagemid expression vector via Spe I and EcoR I 
cohesive terminii. The p€omb2-8 has only one Spe I restric- 
tion site, 

(ii) Preparation of pCombm 

A separate phagemid expression vector was constructed 
using sequences encoding bacteriophage cpHI membrane 
anchor domain. A PCR product defining the cpHI membrane 
anchor and Spe I and EcoR I cohesive termini was prepared 
as described for cpVTH. the details of which are described in 
Example 2g. The cpHI-derived PCR product was then 
ligated into linearized pComb vector to form the vector 
pCombm (also designated pComb3). 
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A preferred phagemid expression vector for use in this 
invention, designated either pComb2-III or pComb2-3. is 
prepared as described above by directionally ligating the 
cpHI membrane anchor-encoding PCR fragment into a- 
5 pComb2 phagemid expression vector via Spe I and Spe I 
cohesive terminii. The pComb2-3 has only one Spe I restric- 
tion site. 

c. Construction of pCBAK Vectors Having a 
Chloramphenicol Resistance Marker 

In order to utilize a different selectable marker gene, such 
as chloramphenicol acetyl transferase (CAT), for the selec- 
tion of bacteria transformed with a vector of this invention, 
expression vectors based on pComb were developed having 
15 a gene encoding CAT and are designated pCBAK vectors. 
The pCBAK vectors are prepared by combining portions of 
pCB and pComb. 

(i) Preparation of pCB 

pBlueScript phagemid vectors. pBC SK(-) and pBS SK(- 

20 ). (Stratagene), were modified and combined to generate a 
third vector designated pCB as described below. 

pBC SK(-). which contains a chloramphenicol resistance 
selectable marker gene, was digested with Bst BI and blunt 
ended with T4 polymerase. A second digestion with Pvu I 

25 allowed for the removal of a 1 kilobase (kb) fragment 
leaving a 2.4 kb linearized vector which retained the CAT 
selectable resistance marker gene, an inducible LacZ pro- 
moter upstream from the LacZ gene and a ColEl origin 
region. The 2.4 kb fragment was recovered. The pBS SK(-> 

30 vector was digested with Aat H and blunt ended with T4 
polymerase. A second digestion with Pvu I allowed for the 
isolation of an 800 base pair (bp) fragment containing the f 1 
origin of replication. Ligation of the pBS derived 800 bp f 1 
fragment with the 2.4 kb pBC fragment created a pCB 

35 precursor vector containing a Sac I site, an fl origin of 
replication, a CAT selectable resistance marker gene. ColEl 
origin, a multiple cloning site (MCS) flanked by T 3 and T 7 
promoters, and an inducible LacZ promoter upstream from 
LacZ gene. 

40 The pCB precursor vector was then digested with Sac I 
and blunt-ended with T4 polymerase. The T4 polymerase - 
treated pCB vector was then religated to form pCB vector 
and is lacking a Sac I site. 

(ii) Preparation of pCBAKO 

45 The pCB vector containing the CAT selectable resistance 
marker gene was digested with Sac II and Apa I and treated 
with phosphatase to prevent rcligation and to form linearized 
pCB vector. The pComb vector prepared in Example l(aXiv) 
was restriction digested with Sac II and Apa I to release a 

so fragment containing nucleotide residue sequences starting 5* 
to the LacZ promoter and extending past the 3* end of the 
second Not I site. The Sac II and Apa I pComb DNA 
fragment was then directionally ligated into the similarly 
digested pCB vector to form phagemid expression vector 

55 pCBAKO. Preferred pCBAK expression vectors are con- 
structed with pCorab2. The resultant pCBAK expression 
vector contains only one Spe I restriction site. 

(iii) Preparation of pCBAK8 

To prepare a pCBAK-based phagemid expression vector 
60 which encodes a bacteriophage coat protein membrane 
anchor domain in the expressed fusion protein. pCB 
phagemid cloning vector prepared in Example lc(ii) was 
linearized by digestion with Sac II and Apa L The pCom- 
bVUI phagemid expression vector, prepared in Example 
65 lb(i), was restriction digested with Sac II and Apa I to form 
a fragment containing a nucleotide residue sequence starting 
5' to the LacZ promoter and extending past the J end of the 
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second Not 1 site. The fragment was directional] y ligated 9. If cpIH is used, the accumulation occurs on the tail of the 
into the linearized pCB cloning vector to form phagemid bacteriophage. The advantage of the utilization of mem- 
expression vector pCBAK8. brane anchors from cpVUI over cpHI is two fold. Firstly, a 
(iv) Preparation of pCBAK3 multiplicity of binding sites, consisting of approximately 
The phagemid expression vector. pCBAK3. for the 5 2700 cpVTH monomers assembled in a tubular array, exist 
expression of fusion protein having cpm membrane anchor along the particle surface. Secondly, the construct does not 
domains, was similarly constructed by direcu'onalry ligating interfere with phage infectivity. 
the Sac II and Apa I restriction digested fragment from a. Polynucleotide Selection 

pCombm with Sac II and Apa I linearized pCB cloning The nucleotide sequences encoding the immunoglobulin 

vector. 10 protein CDR's are highly variable. However, there are 

several regions of conserved sequences that flank the V 

2. Construction of Dicistronic Expression Vectors region domains of either the light or heavy chain, for 

for Expressing Anti-NPN Heterodimer on Phage instance, and that contain substantially conserved nucleotide 

Surfaces sequences, i.e.. sequences that will hybridize to the same 

^^^^^^T^^^lZ a»d incorporate restriction sites into the DNA 
arefirst^getedtothepenplasmof £ ~/. h oLlog produced that are suitable for operatively linking 

of heterodimenc Fab mfccufcs. In order to obtarn expres, Resized DNA fragments to a vector were con- 

sion of anu-body Fab nbranes on » M struct L More specfficaUy^he primers are designed so that 

nucleoude residue sequences , encoding ether th : Fd or hght DNAbomologs produced can be inserted into 
chains must be operatively Ijnk « to an expre$si * n veetor of this invention in reading frame with 

sequence enco^g a filamentous b^Kptage ™* P~te» ^ P ttMslatable DNA DCe at the region of ^ 

membrane _anchor .Twopreferred coat proteins foruse , ur dus vector containing the directional ligation means, 

invention in providing a membrane anchor are VIII and HI ^. v ° 

(cpVin and cpm. respectively) m u^e Exarnples described » W ' * *^ cation of ^ V „ domains, primers are 

^™^ fOT0 ^ designed^ introduce cohesive^errnini compatible with 

sequence encoding a Fd chain to either cpVHI or cpffi ^ ^ ^ u fi ^ ft ^ ^ T and s l sitcs of 

membrane anchors in a fusion protein of this invention are ^ ^ q vcctor For cxam p le , the 3' 

described. M primcr (prin^ J2A in Table 5). was designed to be comple- 

In a phagemid vector, a first and second cistron consisting racDt ary t0 me mRNA in the J w region. In all cases, the 5' 

of translatable DNA sequences are operatively linked to prin^ (primers 1-10. Table 5) were chosen to be complc- 

form a dicistronic DNA molecule. Each cistron in the mcnt ary to the first strand cDNA in the conserved 

dicistronic DNA molecule is linked to DNA expression N . t erminus region (antisense strand). Initially amplification 

control sequences for me coordinate expression of a fusion 35 was petfood with a mixture of 32 primers (primer 1. Table 

protein. Fd-cpVHI or Fd-cpin. and a kappa light chain. 5) ma| were ^ geJla3 i c at five positions. Hybridoma mRNA 

The first cistron encodes a periplasmic secretion signal could be amplified with mixed primers, but initial attempts 

(pelB leader) operatively linked to the fusion protein, either lo amplify mRNA from spleen yielded variable results. 

Fd-cpVm or Fd-cpEL The second cistron encodes a second Therefore, several alternatives to amplification using the 

pelB leader operatively linked to a kappa light chain. The ^ mixed 5' primers were compared. 

presence of the pelB leader facilitates the coordinated but alternative was to construct multiple unique 

separate secretion of both the fusion protein and light chain r^imers. eight of which are shown in Table 5, corresponding 

from the bacterial cytoplasm into the periplasmic space, to ^dividual members of the mixed primcr pool. The 

The process described above is schematically dia- individual primers 2-9 of Table 5 were constructed by 

grammed in FIG. 8. Briefly, the phagemid expression vector 45 incorporating either of the two possible nucleotides at three 

carries a chloramphenicol acetyl transferase (CAT) select- c f the five degenerate positions. 

able resistance marker gene in addition to the Fd-cpVIH The second alternative was to construct a primer contain- 

fusion and the kappa chain. The fl phage origin of replica- j n g inosine (primer 10. Table 5) at four of the variable 

tion facilitates the generation of single stranded phagemid. positions based on the published work of Takahashi. et al.. 

The isopropyl thiogalactopyranos.de (IPTG) induced 50 Proc. Natl Acad. ScL (USA.), 82:1931-1935. (1985) and 

expression of a dicistronic message encoding the Fd-cpVm Ohtsuka et al.. J. Biol Chenu, 260:2605-2608. (1985). This 

fusion (V„, C W1 , cpVTH) and the light chain (V t , CJ leads primer has the advantage that it is not degenerate and, at the 

to the formation of heavy and light chains. Each chain is . same time miiiimizes the negative effects of mismatches at 

delivered to the periplasmic space by the pelB leader the unconsented positions as discussed by Martin etaL.Mic. 

sequence, wruch is subsequently cleaved. The heavy chain is 55 Acids Res., 13:8927 (1985). However, it was not known if 

anchored in the membrane by the cpVTH membrane anchor the presence of inosine nucleotides would result in incor- 

domain while the light chain is secreted into the periplasm. poration of unwanted sequences in the cloned V H regions. 

The heavy chain in the presence of light chain assembles to Therefore, inosine was not included at the one position that 

form Fab molecules. This same result can be achieved if. in remains in the amplified fragments after the cleavage of the 

the alternative, the light chain is anchored in the membrane ^ restriction sites. As a result, inosine was not in the cloned 

via a light chain fusion protein having a membrane anchor insert. 

and heavy chain is secreted via a pelB leader into the Additional V w amplification primers including the unique 

periplasm. 3' primer were designed to be complementary to a portion of 

With subsequent infection of £. colt with a helper phage, the first constant region domain of the gamma 1 heavy chain 

as the assembly of the filamentous bacteriophage progresses. 65 mRNA (primers 16 and 17, Table 5). These primers will 

the coat protein VHI is incorporated along the entire length produce DNA homologs containing polynucleotides coding 

of the filamentous phage particles as shown in FIGS. 8 and for amino acids from the V„ and the first constant region 
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domains of the heavy chain. These DNA homologs can 
therefore be used to produce Fab fragments rather than an 

Additional unique 3' primers designed to hybridize to 
similar regions of another class of immunoglobulin heavy 
chain such as IgM. IgE and IgA are contemplated Other 3' 
primers that hybridize to a specific region of a specific class 
of constant region and are adapted for transferring the 
V H domains amplified using this primer to an expression 
vector capable of expressing those domains with a 
different class of heavy or light chain constant region are 
also contemplated. 

As a control for amplification from spleen or hybridoma 
mRNA. a set of primers hybridizing to a highly conserved l5 
region within the constant region IgG. heavy chain gene 
were constructed The 5' primer (primer 11. Table 5) is 
complementary to the cDNA in the C H 2 region whereas the 
3' primer (primer 13, Table 5) is complementary to the 
mRNA in the C^3 region. It is believed that no mismatches M 
were present between these primers and their templates. 

The primers used for amplification of heavy chain Fd 
fragments for construction of Fabs are shown at least in 
Table 5. Amplification was performed in eight separate 
reactions, each containing one of the 5' primers (primers 25 
2-9) and one of the 3' primers (primer 16). The remaining 
5 primers that have been used for amplification in a single 
reaction are either a degenerate primer (primer 1 ) or a primer 
that incorporates inosine at four degenerate positions 
(primer 10, Table 5. and primers 17 and 18, Table 6). The 30 
remaining 3* primer (primer 14. Table 6) has been used to 
construct F v fragments. Many of the 5' primers incorporate 
a Xho I site, and the 3' primers incorporate a Spe I restriction 
site for insertion of the V H DNA homolog into the phagemid 
Hc2 expression vector (FIG. 4). 35 

V H amplification primers designed to amplify human 
heavy chain variable regions are shown in Table 6. One of 
the 5* heavy chain primer contains inosine residues at 
degenerate nucleotide positions allowing a single primer to 
hybridize to a large number of variable region sequences. 40 
Primers designed to hybridize to the constant region 
sequences of various IgG mRNAs are also shown in Table 
6. 



(ii) V L Primers 

The nucleotide sequences encoding the V L CDRs are 
highly variable. However, there are several regions of coo- 
served sequences that flank the CDR domains including 
the J L . Yl framework regions and V L leader/promotor. 
Therefore, amplification primers were constructed that 
hybridized to the conserved sequences and incorporate 
restriction sites that allow cloning the amplified fragments 
into the phagemid Lc2 vector cut with Sac I and Xba L 

For amplification of the V L CDR domains, the 5* primers 
(primers 1-8 in Table 6) were designed to be complementary 
to the first strand cDNA in the conserved N-terminus region. 
These primers also introduced a Sac I restriction endonu- 
clease site to allow the V L DNA homolog to be cloned into 
the phagemid Lc2 expression vector. The 3* V L amplification 
primer (primer 9 in Table 6) was designed to be comple- 
mentary to the mRNA in the 1 L regions and to introduce the 
Xba 1 restriction endonuclease site required to insert the 
DNA homolog into the phagemid Lc2 expression vector 
(FIG. 6). 

Additional 3* amplification primers were designed to 
hybridize to the constant region of either kappa or lambda 
mRNA (primers 10 and 11 in Table 6). These primers allow 
a DNA homolog to be produced containing polynucleotide 
sequences coding for constant region amino acids of either 
kappa or lambda chain. These primers make it possible to 
produce an Fab fragment rather than an Fy, 

The primers used for amplification of kappa light chain 
sequences for construction of Fabs are shown at least in 
Table 6. Amplification with these primers was performed in 
5 separate reactions, each containing one of the 5' primers 
(primers 3-6. and 12) and one of the 3' primers (primer 13). 
The remaining 3 r primer (primer 9) has been used to 
construct F v fragments. The 5* primers contain a Sac I 
restriction site and the 3* primers contain a Xba I restriction 
site. 

amplification primers designed to amplify human light 
chain variable regions of both the lambda and kappa iso- 
types are also shown in Table 6. 

All primers and synthetic polynucleotides described 
herein, including those shown in Tables 3-7 were either 
purchased from Research Genetics in Huntsville. Ala. or 
synthesized on an Applied Biosystems DNA synthesizer, 
model 381 A, using the manufacturer's instruction. 



TABLE 5 



(1) yAGGTT(C/GXC/A)A(G/AXn , (G/T)CTCGAGTC(T/A)GG 3* 



(2) 5 ' AOGTC CAGCTGCTC G AGTCTGG 3* 

(3) ^ AGGTC CAGCTGCTC G AGTC AGO 3' 

(4) 5' AGGTC CAGCTTCTCG AGTC TGG 3* 

(5) yAGGTCCAGCTTCTCGACriCAGG 3* 

(6) yAGGTCCAAC 1 GC J LGAGTCTGG 3* 

(7) yAGOTCCAACTCCTCGAGTCAGG 3' 

(8) 5" ACXJITX AACTTCTCG AG 1 C IGG 3" 

(9) yAGGTCCAACTrCTCGAGTCAGG 3" 

(10) yAC<3TNNANCTNCrcGAGTC(T/A)GG 3* 

(11) StXXCAACKSATGTGCTCACC 3* 



degenerate 3' primer for the amplification 
of mouse and h*www heavy chain variable 
regions (Vrf 

Unique S piim er for the amplification 
of mouse and human V H 
Unique 5* primer for the amplification 
of mouse and lT* trnfln 
Unique 5* pr mwt for the amplification 
of mouse and human V K 
Unique $ prima for the amplification 
of mouse and human V H 
Unique 5" primer for the amplification 
of mouse and human V K 
: Unique 5* primer for the amplification 
of mouse and human V H 
Unique S primer for the amplification 
of mouse and fr**""* ^ V jj 
Unique 5* primer for the amplification 
of mouse arK i hp*™*** 1 V H 
5* degenerate primer containing inosme at 

4 degenerate positions for amplification of 
mouse 

5 primer far amplification in the C M 2 
region of mouse lgGl 
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TABLE 5-continued 



(12) 5 CTAITAGAAITCAACGGTAACAGTGGTT3CCTTOGCCCCA 3" 
(12A) 5 , CTATTAACTAGTAACCXjrAACAGTCX3TGCCTrc CCCCA 3' 

(13) 5 CTC AGTATGGTCGTTGTGC 3* 

(14) 5*GCrACTAGTTTTGArTTCCACCTTGG 3* 

(15) S'CAGCCATGGCCGACATCCAGATG 3* 

(16) 5'AATTTTACTAGTCACCriGGTGCIXXrTGGC 3' 

(17) 5X\TGCAAC1AGTACAACCACAATCCCTGGGCACAATTTT 3' 

(18) 5* AGGCTTACTAGTAC AATCCCTGGGC AC AAT 3* 



3* primer for amplification of V H and 
■ introducing a 3* Eco RI site 
3* primer for amplification of V H using 3' 
Spe 1 site 

3' primer for amplification in tbc C H 3 

region of mouse IgGl 

3' primer for amplification of mouse kappa 

light chain variable regions (V,_) 

5* primer for amplification of mouse kappa 

light chain variable regions 

Unique 3' primer for amplification of V H 

including part of the mouse gamma 1 first 

constant region 

Unique 3' primer for amplification of Fd 
including part of mouse IgGl first constant 
region and binge region 
3* primer for amplifying mouse Fd including 
part of the mouse IgG first constant region 
and part of the hinge region 



TABLE 6 



(i) 

(2) 
(3) 
(«> 
(5) 
(6) 
(7) 
(8) 
(9) 



S CCAGTTCCG AGC 1 LXJ 1 1 UJX3ACTCAGGAATCT 3* 
3' CCAGTTtXGAGCTCGTGTTGACGCAGCOGCCC 3* 
5" CCAGTTCCGAGCTCXTrGCrCACCCAGTCTCCA T 
5" CCAGTrCCOAGCrcCAGATCACCCAGTCTCCA 3* 
5* CCAGATGTGAGCTCGTGATGACCCAGACTCC A 7 
5 CCAGAItnGAGCICGTCATGACCCAGTXTrCCA 3* 
S CCAGATGTGAGCTCTITjATGACCCAAACTCAA 3* 
5* CCAGATGrrGAGCTCGTCAlAACCCAGGATGAA 3* 
S GCAGCATTCTAGAGTTTCAGCTCCAGCTTGCC 3" 



( 10) 5' CCGCCGTCTAGAACACTC AJ 'I UC1 U 1 1 GAAGCT 3* 

(11) 5* CCGCCGTCTAGAACATICTGC AGGAGACAGACT 3* 

(12) 5* CCAGTITCCGAGCTCOTGaTOACACAGTCICCA 3' 

(13) S GCGCCGTICIAGAATTAACACTC Ai ' 1 UL lOi 1UAA 3* 

(14) 5* CTATTAACIAGTAACGGTAACAGT^ 

-(15) S AGGCTTACTAGTACAATfXCTGGGC AC AAT 3* 

(1 6) 5* GCCGCTtnAGAACACTCATTCCITjrTGAA 3* 

(17) 5" AGGTnAICnCTCGAGTCTGC 3* 

(18) 5' ACXHTIAJCTICTCGAGTCAGC 3* 



(19) S GTGCCAGATGTGAOCTCGTGATGACCX: AO I^TCCA 3* 

(20) 5 TCCTTCTAG ATTACTAACACTCTCCC 7 

(2 1) 5 GCATTCTAGACTATTA1T3AACATTCTGTAGGGGC 3* 

(22) S CTGCACAGGGTCCTCKKXrCGAGCTtXTKXnn 3* 

(23) 5" AGTTCCAIITGCTCGAGTXrTGG 3* 



(24) y GTGOGC ATCTTCiTGAG" 1" 1 U 1U J UACTAGTTCGCXJ 11 1' 1GAGCTC 3* 

(25) 5' AGCaTCACTAGTACAAGATTTGGGCTC 3* 



Unique 5' primer for the amplication 
of kappa light chain variable regions 
Unique 5 1 primer for the amplication 
of kappa light chain variable regions 
Unique 5* primer for the amplication 
of kappa light chain variable regions 
Unique S primer for the amplication 
of kappa light chain variable regions 
Unique S primer for the amplication 
of kappa light chain variable regions 
Unique S primer for the amplication 
of kappa light chain variable regions 
Unique 5* primer for the amplication 
of kappa light chain variable regions 
Unique 5* primer for the amplication 
of kappa light chain variable regions 
Unique 3' primer for amplification of 
kappa light chain variable regions 
Unique 3" primer for mouse kappa light 
chain amplification including the 
constant region 

Unique 3' primer for mouse lambda light 
chain amplification including the 
constant region 
Unique y primer for V L 
amplification 

Unique 3' primer for amplification of 
kappa light chain 

Unique 3* primer for amplification of 
mouse F v 

Unique 3* primer for amplification 

of mouse IgG Fd 

Unique 3* primer for amplification 

of mouse kappa light chain 

Degenerate 5" primer containing 

inosine at 4 degenerate positions for 

amplifying mouse Vjj 

Degenerate 5* primer containi n g 

inosine at 4 degenerate positions for 

amplifying mouse V H 

Unique y primer for human and mouse 

kappa V L amplification 

Unique 3* primer for kappa 

V L amplification 

Unique 3' primer for human, mouse and 
rabbit lambda V L amplification 
Unique y primer for human lambda 
V L amplification 

5' degenerate pr im er for human V H 
amplification coot Billing t nosi n t 
at 3 degenerate positions 
Unique 3* primer for human V H 
amplification 

Unique 3* primer for human IgGl Fd 
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TABLE 6-continued 









(26) 


*• * nr aty' a r*TA<"TTA C AAGATTTGGGC1XT 3" 


Unique 3' primer for amplification 




of human variable regions 


<*) 


j AGO 1 UC ACK- ITj*- IL-OALr l v^ovjvj j 


Unique 3* primer for amplification 




nf Tiitm4H v "iff o V%1^ T^ffiAfl^ n/.*\ 
vJl juuiau TM jAi/tc reywuj \ t j|/ 


(28) 


5* AGGTTCC AACTGC TC G AGTCTGG 3 


Unique 3* primer for amplification 




z*\ f Kiimin vwvahl^ f^o^n< A/„l 
ul UUUiaii " «4 1*11/ 1 w ic£jmia \ ▼ 


(29) 


5" AGGTGCAACTtSCTCGAGTCGCfO 3 






of human variable regjGiis (Vj|) 


(30) 


5* lULri ll^lAGATTACTAAwVC lv_ ILa-*^- ikj I iuaa j 


3* primer is buman kappa light chain 




constant region 


00 


5* CTGCACAGGGTCCTGGGCCGAGCTCGTGGTGACTCAG 3 


5* pnmcr for mnplmxatioD of human 




lambda Li gill ftv*»n variable regions 


(32) 


5* GCATTCIAGACTATTAACAl iuIuIAGGCGC 3' 


3* primer in buman lambda light chain 




constant rc£ joq 


(33) 


5' ACCC AAGG AC AC CCTC ATG 3* 


Control primer hybridizing to the 




human CH 2 region 


(34) 


5* CTCAGTATGU 1 GUI rGTGC 3* 


Control primer hybridizing to the 




human CH 3 region 


(35) 


5' GTCTCACTAGnrrCCACCAAGGGCCCAICGGTC 3' 


5' primer for amplifying human IgG 




heavy chain first constant region 


(36) 


y ATATACTAGTGAGACAGTGACCAGGGTTCCriXX3CCrc 3* 


3' primer for amplifying human heavy 




chain variable regions 


(37) 


y ACGTCTAGATICCACC lTGGTCCC 3* 


3' primer for amplifying buman fcapp 




chain variable regions 


(38) 


y GCA3ACXAGTCTATIAACAJTCTOXAGGGGC 3* 


5' primer for amplifying human kappa 




light chain constant region 


(39) 


y CCGGAA1TCTTATCATTIACCCGGAGA 3* 


3' primer located in the CH3 region of 




buman IgG J to amplify tbe entire heavy 






chain 


(40) 


y TCTCCACTAGTKX3AATGGGCACATGCAG 3* 


3* primer for amplifying the Fd region 




of mouse IgM 



The 19 primers listed in Table 5 have been listed in the 
Sequence Listing and have been assigned the following 
SEQ. ID. NO.: 

(1)=SEQ. ID. NO. 40 
(2>=SEQ. ID. NO. 41 

(3) =SEQ. ID. NO. 42 

(4) =SEQ. ID. NO. 43 

(5) =SEQ. ID. NO. 44 

(6) =SEQ. ID. NO. 45 

(7) =SEQ. ID. NO. 46 

(8) =SEQ. ID. NO. 47 

(9) =SEQ. ID. NO. 48 

(10) =SEQ. ID. NO. 49 
(1J)=SEQ. ID. NO. 50 

(12) =SEQ. ID. NO. 51 
(12A)=SEQ. ID. NO. 52 

(13) =SEQ. ID. NO. 53 

(14) =SEQ. ID. NO. 54 .. . . ~ 

(15) =SEQ. ID. NO. 55 
(16>SEQ. ID. NO. 56 

(17) =SEQ. ID. NO. 57 

(18) =SEQ. ID. NO. 58 

The 40 primers listed as *\l) n through "(40)" in Table 6 
have also been individually and sequentially listed in the 
Sequence Listing beginning with SEQ. ID. NO. 59 through 
SEQ. ID. NO. 98. respectively. 

b. Preparation of a Repertoire of Genes Encoding Immuno- 
globulin Variable Domain 

NitrophenylphosrAonaimdate (NPN) was selected as the 
ligaod fox receptor binding in preparing a hetexodimeric 
receptor according to the methods of the invention. 

Keyhole limpet hemocyanin (KLH) was conjugated to 
NPN to form a NPN-KLH conjugate used for i mmunizin g a 
mouse to produce an anti-NPN immune response and 
thereby provide a source of ligand specific hetexodimeric 
receptor genes. 
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The NPN-KLH conjugate was prepared by adxnixing 250 
ul of a solution containing 2.5 mg of NPN in dimethylfor- 
mamide with 750 ul of a solution containing 2 mg of KLH 
in 0.01 Molar (M) sodium phosphate buffer (pH 7.2). The 
two solutions were admixed by slow addition of the NPN 
solution to the KLH solution while the KLH solution was 
being agitated by a rotating stirring bar. Thereafter the 
admixture was maintained at 4° C. for 1 hour with the same 
agitation to allow conjugation to proceed. The conjugated 
40 NPN-KLH was isolated from the oonconjugated NPN and 
KLH by gel filtration through Sephadex G-25. The isolated 
NPN-KLH conjugate was injected into mice as described 
below. 

The NPN-KLH conjugate was prepared for injection into 

45 mice by adding 100 ug of the conjugate to 250 ul of 
phosphate buffered saline (PBS). An equal volume of com- 
plete Fecund* s adjuvant was added and emulsified the entire 
solution for 5 minutes. A 129 G,* + mouse was injected with 
300 ul of the emulsion. Injections were given subcutane- 

50 ously at several sites using a 21 gauge needle. A second 
immunization with NPN-KLH was given two weeks later. 
This injection was prepared as follows: 50 micrograms (ug) 
of NPN-KLH were diluted in 250 ul of PBS and an equal 
volume of alum was admixed to the NPN-KLH solution. 

55 The mouse was injected intraperitoneally with 500 ul of the 
solution using a 23 gauge needle. One month later the mice 
were given a final injection of 50 ug of the NPN-KLH 
conjugate diluted to 200 ul in PBS. This injection was given 
intravenously in the lateral tail vein using a 30 gauge needle. 

60 Five days after this final injection the mice were sacrificed 
and total cellular RNA was isolated from their spleens. 

Total cellular RNA was prepared from the spleen of a 
single mouse immunized with KLH-NPN as described 
above using the RNA preparation methods described by 

65 Chomczynski et al.. Anal BiochtnL, 162:156-159 (1987) 
and using the RNA isolation kit (Strata gene) according to 
the manufacturer's instructions. Briefly, immediately after 
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removing the spleen from the immunized mouse, the tissue polynucleotides, i.e.. greater than about 10 4 different 

was homogenized in 10 ml of a denaturing solution con- V^-coding genes, and contains a similar number of 

taining 4.0M guanine isothiocyanate. 0.25M sodium citrate V L -coding genes. Thus, the mRNA population represents a 

at pH 7.0. and 0.1M beta-mercaptoethanol using a glass repertoire of variable region-coding genes, 
homogenizer. One ml of sodium acetate at a concentration of 5 c . Preparation of DNA Homologs 

2M at pH 4.0 was admixed with the homogenized spleen. preparation for PGR amplification. mRNA prepared 

One ml of phenol that had been previously saturated with abovc is uscd as a template for cDNA synthesis by a primer 

H 2 0 was also admixed to the denaturing solution containing extension reaction. In a typical 50 pi transcription reaction, 

the homogenized spleen. Two ml of a cMorofarmasoamyl ug ^ j^j^ ^ watcr i s first hybridized 
alcohol (24:1 v/v) mixture was added to this homogenate 1Q (anncaled) witn 500 og (50 0 pmol) of the 3' V„ primer 

n,e homogenate was mixed vigorously for ten seconds and ef J2A TafeIe ^ 65 o c fa fiyc minutcs< 

nuuntaine^nice^ Subsequently, the mixture is adjusted to 1.5 mM dXTP. 

T*¥?£ £ \ ^^i^^SSS^T^ dCTP. dGTP and oTI?. 40 mM Tris-HCL pH 8.0. 8 mM 

The upper RNA-contlining aqueous layer was transferred to « Murine Le ukernia vunis Reverse transcriptase (StraUgene). 

a fresh 50 ml polypropylene centrifuge tube and mixed with 26 units, is added and the solution is maintained for 1 hour 

an equal volume of isopropyl alcohol. This solution was at 37 C 

maintained at -20° C. for at least one hour to precipitate the PCR amplification is performed in a 100 pi reaction 

RNA. The solution containing the precipitated RNA was containing the products of the reverse transcription reaction 

centrifuged at lO.OOOxg for twenty minutes at 4° C. The 20 (approximately 5 ug of the cDNA/RNA hybrid). 300 ng of 

pelleted total cellular RNA was collected and dissolved in 3 3' V H primer (primer 12A of Table 5). 300 ng each of the 5' 

ml of the denaturing solution described above. Three ml of V„ primers (primers 2-10 of Table 5) 200 mM of a mixture 

isopropyl alcohol was added to the re-suspended total eel- of dNTP's. 50 mM KC1. 10 mM Tris-HCl pH 8.3. 15 mM 

lular RNA and vigorously mixed. This solution was main- MgCl 2 . 0.1% gelatin and 2 units of Thermus aquatic us (Taq) 

tained at -20° C. for at least 1 hour to precipitate the RNA. 25 DNA polymerase. The reaction mixture is overlaid with 

The solution containing the precipitated RNA was centri- mineral oil and subjected to 40 cycles of amplification. Each 

fuged at lO.OOOxg for ten minutes at 4° C The pelleted RNA amplification cycle includes denaturation at 92° C for 1 

was washed once with a solution containing 75% ethanoL minute, annealing at 52° C. for 2 minutes and polynucleotide 

The pelleted RNA was dried under vacuum for 15 minutes synthesis by primer extension (elongation) at 72° C. for 1.5 

and then rc-suspended in dimethyl pyrocarbonate (DEPQ 30 minutes. The amplified V coding DNA homolpg contain- 

treated (DEPC-H 2 0) H 2 0. ing samples arc then extracted twice with phenol/ 

Messenger RNA (mRNA) enriched for sequences con- chloroform, once with chloroform, ethanol precipitated and 

taining long poly A tracts was prepared from the total arc stored at -70° C. in 10 mM Tris-HCl, pH 7.5. and 1 mM 

cellular RNA using methods described in Molecular Clon- EDTA. 

ing; A Laboratory ManuaLM2^tisctaX..eds.. Cold Spring 35 Using unique 5* primers (2-9. Table 5). efficient 

Harbor, N.Y.. (1982). Briefly, one half of the total RNA V^-coding DNAhomolog synthesis and amplification from 

isolated from a single immunized mouse spleen prepared as the spleen mRNA is achieved as shown by agarose gel 

descried above was re-suspended in one ml of DEPC-H 2 0 electrophoresis. The amplified cDNA (V^ coding DNA 

and maintained at 65° C. for five minutes. One ml of 2xhigh homolog) was seen as a major band of the expected size (360 

salt loading buffer consisting of 100 mM Tris-HCl (Tris 40 bp). The amount the amplified V^-coding polynucleotide 

[hydroxymethyl] amino methane hydrochloride), 1M fragment in each reaction is similar, indicating that all of 

sodium chloride (NaCl). 2.0 mM disodium ethylene diamine these primers were about equally efficient in initiating 

terra-acetic acid (EDTA) at pH 7.5. and 0.2% sodium amplification.The yielded quality of the amplification with 

dodecyl sulfate (SDS) was added to the re-suspended RNA these primers is reproducible. 

and the mixture allowed to cool to room temperature. The 45 The primer containing inosine also synthesizes amplified 
mixture was then applied to an oUgo-dT (Collaborative V w -coding DNA homologs from spleen mRNA 
Research Type 2 or Type 3) column that was previously reproducibly. leading to the production of the expected sized 
prepared by washing the oligo-dT with a solution containing fragment, of an intensity s imilar to that of the other ampli- 
0.1M sodium hydroxide and 5 mM EDTA and then equili- fied cDNAs. The presence of inosine also permits efficient 
brating the column with DEPC-H 2 0. The eluate was col- 50 DNA homolog synthesis and amplification, clearly indicat- 
lected in a sterile polypropylene tube and reapplied to the ing that such primers are useful in generating a plurality of 
same column after heating the eluate for 5 minutes at 65° C. V„-coding DNA homologs. Amplification products 
The oligo dT column was then washed with 2 ml of high salt obtained from the constant region primers (primers 1 1 and 
loading buffer consisting of 50 mM Iris-HQ, pH 7.5. 500 13, Table 5) are more intense indicating that amplification 
mM sodium chloride. 1 mM EDTA at pH 7.5 and 0.1% SDS. 55 was more efficient, possibly because of a higher degree of 
The oligo dT column was then washed with 2 ml of homology between the template and primers. Following the 
Ixmedium salt buffer consisting of 50 mM Tris-HCL pH 7.5, above procedures, a V^coding gene library is constructed 
100 mM. 1 mM EDTA and 0. 1% SDS. The messenger RNA from the products of eight amplifications, each performed 
was eluted from the oligo dT column with 1 ml of buffer with a different 5 T primer. Equal portions of the products 
consisting of 10 mM Tris-HCl. pH 7 J, 1 mM EDTA, at pH 60 from each primer extension reaction are mixed and the 
7.5. and 0.05% SDS. The messenger RNA was purified by mixed product is then used to generate a librairy of 
extracting this solution with phenol/chloroform followed by V^-coding DNA homolog-containing vectors, 
a single extraction with 100% chloroform. The messenger DNA homologs of the V L are also prepared from the 
RNA was concentrated by ethanol precipitation and purified mRNA prepared as described above. In preparation 
re-suspended in DEPC H 2 0. 65 for PCR amplification. mRNA prepared according to the 
The messenger RNA (mRNA) isolated by the above above examples is used as a template for cDNA synthesis- 
process contains a plurality of different V w coding In a typical 50 pi transcription reaction, 5-10 ug of spleen 
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mRNA in water is first annealed with 300 ng (50.0 pmol) of prepared V„ DNA homologs arc then directly inserted into 

the 3' V primer (primer 14. Table 5). at 65° C. for five the Xho I and Spe I restriction digested Lambda Hc2 

minutes ^Subsequently, the mixtiire is adjusted to 1.5 mM expression vector that prepared above by ligating 3 moles of 

dATP dCTP dGTP. and dTTP. 40 mM Tris-HCI. pH 8.0. 8 V„ DNA homolog inserts with each mole of the Hc2 

rnM MgCl 50 mM NaCl. and 2 mM spermidine. Moloney- 5 expression vector overnight at 5° G Approximately 3 .Ox 10 3 

Murine Leukemia virus reverse transcriptase (Stratagene). plague forming units are obtained after packaging the^ DNA 

26 units is added and the solution is maintained for 1 hour with Gigapack H Bold (Stratagene) of which 50% are 

at 37° C The PCR amplification is performed in a 100 ul recombinants. The ligation mixture containing the V„ DNA 

reaction containing approximately 5 ug of the cDNA/RNA homologs arc packaged according to the manufacturers 

hybrid produced as described above. 300 ng of the J io specifications using Gigapack Gold U Packing Extract 

primer (primer 14 of Table 5), 300 ng of the 5' V,. primer (Stratagene). The resulting Lambda Hc2 expression libraries 

(primer 16 of Table 5). 200 mM of a mixture of dNTP's. 50 are then transformed into XLl-Blue cells. 

mM KG 10 mM Tris-HQ, pH 83. 15 mM MgCl* 0.1% To prepare a library enriched in V,. sequences. PCR 

gelatin and 2 units of Taq DNA polymerase. The reaction amplified products enriched in sequences are prepared 

mixture is overlaid with mineral oil and subjected to 40 15 according to Example 2c. These DNA homologs are 

cycles of amplification. Each amplification cycle includes digested with restriction enzymes Sac I and Aba I and the 

denaturationat 92° C for 1 minute, annealing at 52° G for digested V L DNA homologs are purified on a 1% agarose gel 

2 minutes and elongation at 72° C for 1.5 minutes. The as described above for the V„ DNA homologs to form a 

amplified samples are then extracted twice with phenol/ repertoire of V r polypeptide genes adapted for directional 

chloroform, once with chloroform, ethanol precipitated and 20 Ugation. The prepared V L DNA homologs are then direc- 

are stored at -70° C in 10 mM Tris-HQ. 7.5 and 1 mM tionally ligated into the Lambda Lc2 expression vector 

EI y rA previously digested with the restriction enzymes. Sac I and 

d. Insertion of DNA Homologs into a DNA Expression Xba I as described for Lambda Hc2. The ligation mixture 

Vcctor containing the V t DNA homologs is packaged to form a 

To prepare an expression library enriched in V„ 25 Lambda Lc2 expression library as described above and is 

sequences. DNA homologs enriched in V H sequences are ready to be plated on XLl-Blue cells, 

prepared according to Example 2c using the same set of 5* e. Randomly Coinbining V„ and V,, DNA Homologs on the 

primers but with primer 12A (Table 5) as the 3' primer. The Same Expression Vector 

resulting PCR amplified products (2.5 ug/30 ui of 150 mM The construction of a library containing vectors for 

NaO 8 mM Tris-HQ. pH 7.5, 6 mM MgS0 4 , 1 mM DTT, 30 expressing two cistrons that express heavy and light chains 

200 ug/ml BSA) are digested at 37° C with restriction is accomplished in two steps. In the first step, separate heavy 

enzymes Xho I (125 units) and Spe I (125 units). In cloning and light chain libraries are constructed in the expression 

experiments which required a mixture of the products of the vectors Lambda Hc2 and Lambda Lc2 respectively, as 

amplification reactions, equal volumes (50 uL 1-10 ug described using gene repertoires obtained from a mouse 

concentration) of each reaction mixture are combined after 35 immunized with NPN-KLH. In the second step, these two 

amplification but before restriction digestion. The V w libraries are combined at the anhsymmetncEcoR I sites 

homologs are purified on a 1% agarose gel using the present in each vector. This resulted in a library of clones 

standard electro-elution technique described in MoUcular each of which potentially co-expresses a heavy and a ligfrt 

Cloning A Laboratory Manual. Maniatis et aL. eds.. Cold chain. The actual combinations are random and do not 

Sr^gHaitw.N.Y..(1982).Artogelelecti^ca-esisofthe 40 necessarily reflect the combinations present in the B-cell 

digested PCR amplified spleen mRNA. the region of die gel population in the parent animal 

containing DNA fragments of approximate 350 bps is The spleen mRNA resulting from the above immuniza- 
excised. electro-eluted into a dialysis membrane, ethanol tions (Example 2b) is isolated and used to create a primary 
precipitated and re-suspended suspended in a TE solution library of V„ gene sequences using the lambda Hc2 expres- 
containing 10 mM Tris-HCI. pH 7.5 and 1 mM EDTA to a 45 sion vector. The primary library contains 1 3x10 plaque- 
final concentration of 50 ng/ul. The resulting V„ DNA forming units (pfu) and can be screened for the expression 
homologs represent a repertoire of polypeptide genes having of the decapeptide tag to determine the percentage of clones 
cohesive tennini adapted for directional ligation to the expressing V„ and C„l (Fd) sequences. The sequence for 
vector Lambda Hc2. These prepared V„ DNAhomologs are this peptide is only in frame for expression following the 
then directly inserted by directional ligation into linearized 50 cloning of a Fd (or V w ) fragment into the vector. At least 
Lambda Hc2 expression vector prepared as described below. 80% of the clones in the library express Fd fragments based 

The Lambda Hc2 expression DNA vector is prepared for on irnmunodetection of the decapeptide tag. 

inserting a DNA homolog by afcnixing 100 ug of this DNA The light chain library is constructed^ the same way as 

to a solution containing 250 units each of the restriction the heavy chain and contains 2.5x10 members. Plaque 

endonucleases Xho I and Spe I (both from Boehringer 55 screening, using an anti-kappa chain antir^ody.indicates that 

Mannheim. Indianapolis. Ind.) and a buffer recommended 60% of the library contained express light chain inserts. A 

by the manufacturer. This solution is maintained at 37 from small percentage of inserts results from incomplete dqjos- 

1 5 hours The solution is heated at 65° C. for 15 minutes top phorylation of vector after cleavage with Sac I and Xba L 

inactivate the restriction endonucleases. The solution is Once obtained, the two libraries arc used to construct a 

chilled to 30° C and 25 units of beat-lrillable (HK) phos- 60 combinatorial library by crossing them at the EcoR I site. To 

phatase (Epicenter. Madison, Wis.) and CaQ 2 is admixed to accomplish the cross, DNA is first purified from each hteary. 

it according to the manufacturer's specifications. This solu- The Lambda Lc2 library prepared in Example 2d is 

tion is maintained at 30° G for 1 hour. The DNA is purified amplified and 500 ug of Umbda Lc2 expression library 

by extracting the solution with a mixture of phenol and phage DNA is prepared from the amplified phage stock 

chloroform followed by ethanol predpiution. The Lambda 65 using the procedures described in Molecular Cloning: A 
Hc2 expression vector is now ready for ligation to the V„ Laboratory Manual. Maniatis et al., eds.. Cold Spring 
DNA homologs prepared in the above examples. These Harbor. N.Y (1982). Fifty ug of this amplified expression 
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library phage DNA is maintained in a solution containing 
100 units of MLu I restriction endonuclease (Boehringer 
Mannheim. Indianapolis. Ind.) in 200 ul of a buffer supplied 
by the endonuclease manufacturer for 1.5 hours at 37° C. 
The solution is then extracted with a mixture of phenol and 
chloroform. The DNA is then ethanol precipitated and 
re-suspended in 100 ul of water. This solution is admixed 
with 100 units of the restriction endonuclease EcoR I 
(Boehringer) in a final volume of 200 ul of buffer containing 
the components specified by the manufacturer. This solution 
is maintained at 37° C. for 1.5 hours and the solution is then 
extracted with a mixture of phenol and chloroform. The 
DNA was ethanol precipitated and the DNA re-suspended in 

The Lambda Hc2 expression library prepared in Example 
2d is amplified and 500 ug of Lambda Hc2 expression 
library phage DNA is prepared using the -methods detailed 
above. 50 ug of this amplified library phage DNA is main- 
tained in a solution containing 100 units of Hind III restric- 
tion endonuclease (Boehringer) in 200 ul of a buffer supplied 
by the endonuclease manufacturer for 1.5 hours at 37° C. 
TTie solution is then extracted with a mixture of phenol and 
chloroform saturated with 0. 1M Tris-HCl. pH 7.5. The DNA 
is then ethanol precipitated and re-suspended in 100 ul of 
water. This solution is admixed with 100 units of the 
restriction endonuclease EcoR I (Boehringer) in a final 
volume of 200 ul of buffer containing the components 
specified by the manufacturer. This solution is maintained at 
37° C. for 1.5 hours and the solution is then extracted with 
a mixture of phenol and chloroform. The DNA is ethanol 
precipitated and the DNA re-suspended in TE. 

The restriction digested Hc2 and Lc2 expression libraries 
are ligated together. To that end. a DNA admixture consists 
of 1 ug of Hc2 and 1 ug of Lc2 phage library DNA is 
prepared in a 10 ul reaction using the reagents supplied in a 
ligation kit (Stratagene). The DNA admixture is warmed to 
45° C. for 5 minutes to melt any cohesive terrnini that may 
have reannealed. The admixture is then chilled to 0° C. to 
prevent religation. Bacteriophage T4 DNAligase (0.1 Weiss 
units which is equivalent to 0.02 units as determined in an 
exonuclease resistance assay) is admixed into the chilled 
DNA solution along with 1 ul of 5 mM ATP and 1 ul 
lOxbacteriophage T4 DNA ligase buffer (lOxbuffer is pre- 
pared by admixing 200 mM Tris-HCl. pH 7.6. 50 mM 
Mgd 2 .50 mM DTT. and 500 ug/ml BSA) to form a ligation 
admixture. After ligation for 16 hr at 4° C. 1 pi of the ligated 
the phage DNA is packaged with Gigapack Gold H pack- 
aging extract and plated on XL 1 -Blue cells prepared accord- 
ing to the manufacturers instructions to form a Lambda 
phage library of dicistronic expression vectors capable of 
expressing heavy and light chains derived from the NPN- 
immunized mouse. A portion of the clones obtained are used 
to determine the effectiveness of the combination, 
f. Selection of Anti-NPN Reactive Heterodiroer-Producing 
Dicistronic Vectors 

The combinatorial Fab expression library prepared above 
in Example 2a was screened to identify clones having 
affinity for NPN. To determine the frequency of the phage 
clones which co-expressed the light and heavy chain 
fragments* duplicate lifts of the light chain, heavy chain and 
combinatorial libraries were screened as above for light and 
heavy chain expression. In this study of approximately 500 
recombinant phage, approximately 60% co-expressed light 
and heavy chain proteins. 

All three libraries, the light chain, the heavy chain and the 
combinatorial, were screened to determine if they contained 
recombinant phage that expressed antibody fragments which 
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bound NPN. In a typical procedure 30.000 phage were 
plated on XLl-Blue cells and duplicate lifts with nitrocel- 
lulose were screened for binding to NPN coupled to ,25 I 
labeled BSA. The BSA was iodinated following the 

5 Chloramine-T method as described by Bolton et al.. 
Biochem,, 133:529-534 (1973). Duplicate screens of 80.000 
recombinant phage from the light chain library and a similar 
number from the heavy chain library did not identify any 
clones which bound the antigen. In contrast, the screen of a 

10 similar number of clones from the Fab expression library 
identified many phage plaques that bound NPN. This obser- 
vation indicates that under conditions where many heavy 
chains in combination with light chains bind to antigen the 
same heavy or light chains alone do not Therefore, in the 

15 case of NPN. it is believed that there are many heavy and 
light chains that only bind antigen when they are combined 
with specific light and heavy chains respectively. 

To assess the ability to screen large numbers of clones and 
obtain a more quantitative estimate of the frequency of 

20 antigen binding clones in the combinatorial library, one 
million phage plaques were screened and apr*oximately 100 
clones which bound to antigen were identified. For six 
clones which were believed to bind NPN. a region of the 
plate containing the six positive and approximately 20 

25 surrounding bacteriophage plaques was selected and each 
plaque was cored, replated. and screened with duplicate lifts. 
As expected, approximately one in twenty of the phage 
specifically bound to antigen. Cores of regions of the plated 
phage believed to be negative did not give positives on 

30 replating. 

Clone 2b. one of the plaques which reacted with NPN. 
was excised according to an in vivo excision protocol where 
200 ul of phage stock and 200 ui of a F+ derivative of 
XLl-Blue (A flOO =1.00) (Stratagene) were admixed with 1 ul 

35 of M13mp8 helper phage (lxlO 10 pfu/milliliter (ml)) and 
maintained at 37° C. for 15 minutes. After a four hour 
maintenance in Luria-Bertani (LB) medium and heating at 
70° C. for 20 minutes to heat kill the XLl-Blue cells, the 
phagemids were re-infected into XLl-Blue cells and plated 

40 onto LB plates containing ampicillin. This procedure con- 
verted the cloned insert from the Lambda Zap n vector into 
a plasmid vector to allow easy manipulation and sequencing 
(Stratagene). The phagemid DNA encoding the and part 
of the was then determined by DNA sequencing using the 

45 Sanger dideoxy method described in Sanger et al.. Proc. 
Natl Acad. Scl t 74:5463-5467 (1977) using a Sequenase kit 
according to manufacturer's instructions (US Biochemical 
Corp.. Cleveland. Ohio). The nucleotide residue sequence of 
Clone 2b Fd chain is listed in the Sequence Listing as SEQ. 

50 ID. NO. 99. The nucleotide residue sequences of the kappa 
light chain variable and constant regions are listed in the 
Sequence Listing as SEQ. ID. NO. 100 and SEQ. ID. NO. 
101, respectively. 

g. Preparation of a DNA Sequence Encoding a Filamentous 

55 Phage Coat Protein Membrane Anchor 

cpVm Membrane Anchor M13mpl8. a commercially 
available bacteriophage vector (Pharmacia, Piscataway. 
N J.), was used as a source for isolating the gene encoding 
cpVTtt The sequence of the gene encoding the membrane 

60 anchor domain of cpVTTJ listed in Sequence Listing as SEQ. 
ID. NO. 102. was modified through PGR amplification to 
incorporate the restriction endonuclease sites. Spe I and 
EcoR L and two stop codons prior to the EcoR I site. The 
corresponding amino acid residue sequence of the mem- 

65 brane anchor domain of cpVm is listed as SEQ. ID. NO. 17. 
To prepare a modified cpVUJ, replicative form DNA from 
M13mpl8 was first isolated. Briefly, into 2 ml of LB 
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(Luria-Bertani medium). 50 ul of a culture of a bacterial 
strain carrying an F episome (JM107. JM109 or TGI) was 
admixed with a one tenth suspension of bacteriophage 
particles derived from a single plaque. The admixture was 
incubated for 4 to 5 hours at 37° C. with constant agitation. 
The admixture was then centrifuged at 12.000xg for 5 
minutes to pellet the infected bacteria. After the supernatant 
was removed, the pellet was resuspended by vigorous vor- 
texing in 100 ul of ice-cold solution i Solution I was 
prepared by admixing 50 roM glucose. 10 mM EDTA and 25 
mM Tris-HQ. pH 8.0, and autociaving for 15 minutes. 

To the bacterial suspension. 200 ul of freshly prepared 
Solution II was admixed and the tube was rapidly inverted 
five times. Solution II was prepared by admixing 0.2N 
NaOH and 1% SDS. To the bacterial suspension, 150 ul of 
ice-cold Solution III was admixed and the tube was vortexed 
gently in an inverted position for 10 seconds for to disperse 
Solution m through the viscous bacterial lysate. Solution III 
was prepared by admixing 60 ml of 5M potassium acetate, 
11.5 ml of glacial acetic acid and 28.5 ml of water. The 
resultant bacterial lysate was then stored on ice for 5 minutes 
followed by centrifugation at 12,000xg for 5 minutes at 4° 
C in a microfuge. The resultant supernatant was recovered 
and transferred to a new tube. To the supernatant was added 
an equal volume of phenol: chloroform and the admixture 
was vortexed. The admixture was then centrifuged at 
12.000xg for 2 minutes in a microfuge. The resultant super- 
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of which are listed in Table 7 below, were used in the PCR 
reaction to amplify the mature gene for cpVTH member 
anchor domain and incorporate the two cloning sites. Spe I 
and EcoR t For the PCR reaction. 2 ul containing 1 

5 nanogram (ng) of M13mpl8 replicative form DNA was 
admixed with 10 ul of IQxPCR buffer purchased commer- 
cially (Promega Biotech. Madison. Wis.) in a 0.5 mi 
microfuge tube. To the DNA admixture, 8 ul of a 2.5 mM 

10 solution of dNTPs (dATP. dCTP, dGTP. dTTP) was admixed 
to result in a final concentration of 200 micromolar (uM). 
Three ul (equivalent to 60 picomoles (pM)) of the 5* forward 
AK 5 primer and 3 ul (60 pM) of the 3' backward AK 6 
primer was admixed into the DNA solution. To the 

13 admixture. 73 ul of sterile water and 1 ul/5 units of poly- 
merase (Promega Biotech) was added. Two drops of mineral 
oil were placed on top of the admixture and 40 rounds of 
PCR amplification in a thermocycler were performed. The 
amplification cycle consisted of 52° C. for 2 minutes. 72° C. 

20 for 1.5 minutes and 91° C. for 2 minutes. The resultant PCR 
modified cpVTH membrane anchor domain DNA fragment 
from M13mpl8 containing samples were then purified with 
Gene Clean (BIO101. La Jolla, Calif.), extracted twice with 

25 phenol/chlorof onru once with chloroform followed by etha- 
nol precipitation and were stored at -70° C. in 10 mM 
Tris-HCl. pH 7.5. and 1 mM EDTA. 



TABLE 7 



SEQ. 

ID NO. Primer 

(103) 1 AK 5 (F) 5" GTGCX^CAGGGATTtjrACTAGTOCTGAGGGTGACGAr T 

(104) 2 AK 6 (B) 5 ACJlL TJAAi IL lAlLAi^ TITDC l l lLUAOUlUAA i 

(105) 3 Hc3 (F) 5 AOOTCCAGCTTCTCGAGTtrrGG 3' 

(106) 4 AK 7 (B) 5* GTCACOCTCAGCACTAGTACAAICCXrrGGGCAC 3" 
(I07f G-3 fF) 5" UAUAU UAlTarjKXjTGCXJ^ 

O'UllilGAAXKIXJAA J 

(108) * G-3 (B) y 1 IACLAGC 1 AUCAIAA IAACGGAATACCCAAAA 

GAACTOG 3' ' 

(109) 7 LAC-F y JLAiUC 1ACJCIAGTAAC AC G ACAOGTTTCCC GAC 

1UU s 

(110) ° LAC-B y ACCOAGCTC GAAITC GTAArCXrtXnC 3' 

F Forward Primer 
B Bcckward Primer 

'From y to 3': the overlapping sequence for C„l 3' cod is double under Dried: the Spe 1 
restriction site sequence is single underlined; the overlapping sequence for cpVm is double 
underlined. 

3 EcoR I restriction site sequence is single underlined 
■*Xho I restriction site sequence is underlined 

^From y to 3': the overlapping sequence for cpVm is double underlined; the Spe I restriction 
site sequence is single underlined; the overlapping sequence for C H 1 3' end is double 
underlined 

^Froco 5' to 3*: Spe I restriction site sequence is single underlined; the overtyping sequence 
with die y end of cpm is double underlined 

*FTOtn y to 3': Nhe I restriction site sequence is single imdertincd; the overlapping sequence 
with 3' end of cpITJ is double underlined. 

"Trotn y oo 3*: overlapping sequence with the 3* end of cpffj is double underlined; Nhe I 
restriction sequence begins with the nucleotide residue XT at position 4 and extends 5 more 
residues = GCTAGC. 

B EcoR I restriction site sequence is single imrVrlmrd 



natant was transferred to a new tube and the double-stranded To verify amplification of the modified cpVin membrane 

bacteriophage DNA was precipitated with 2 volumes of anchor domain, the PCR purified DNA products were elec- 

ethanol at room ternperature. After allowing the admixture trophoresed in a 1% agarose geL The expected size of the 

to stand at room temperature for 2 minutes, the admixture ^ was appr03dmate ly 150 base pairs. The area in the 

was centrifuged to pellet the DNA. The supernatant was r t . . . J , .JU .^xta ^ 

■ ^ ,f ^ . . c T^Zr a agarose containing the modified cpVTU DNA fragment was 

removed and the pelleted replicative form DNA was resus- ^^ w UW iuui 6 iu ^ ? 

pended in 25 ul ofTCs-HCZ pH 7.6. and 10 mM EDTA (TE). isolated from as described above. The sequence 
The double-stranded M13mpl8 replicative form DNA 65 of the isolated modified cpVm DNA fragment is listed as 

was then used as a template for PCR. Primers. AK 5 (SEQ. SEQ. ID. NO. 111. The isolated cpVffl DNA fragment was 

ID. NO. 103) and AK 6 (SEQ. ID. NO. 104), the sequences then admixed with a similarly prepared fragment of modi- 
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fied Fd as described below in Example 2i in order to form 
a DNA segment encoding the fusion protein Fd-cpVTJl 

cpHT Membrane Anchor: M13mpl8 was also used as a 
source for isolating the gene encoding the membrane anchor 
domain at cpIH, the sequence of which is listed in the 
Sequence Listing as SEQ. ID. NO. 112. The amino acid 
residue sequence of membrane anchor domain cpm is listed 
in SEQ. H). NO. 16. M13mpl8 replicative form DNA was 
prepared as described above and used as a template for PCR 
for amplifying the mature gene for cpHI membrane anchor 
domain and incorporating the two cloning sites. Spe I and 
EcoR I. 

The primer pair. G-3(F) (SEQ. ID. NO. 107) and G-3(B) 
(SEQ. ID. NO. 108) listed in Table 7. was used in PCR as 
performed above to incorporate Spe I and Nhe I restriction 
sites. The resultant PCR modified cpHI DNA fragment was 
verified and purified as described above. The sequence of the 
PCR modified cpHI membrane anchor domain DNA frag- 
ment is listed in the Sequence Listing as SEQ. ID. NO. 113. 
A second PCR amplification using the primer pairs, Lac-F 
(SEQ. ID. NO- 109) and Lac-B (SEQ. ID. NO. 110) listed 
in Table 7. was performed on a separate aliquot of M13mpl8 
replicative form template DNA. The primers used for this 
amplification were designed to incorporate an overlapping 
sequence with the nucleotides encoding the membrane 
anchor region of cpIIL and the adjacent Nhe I site along with 
a sequence encoding a LacZ promoter region 5* to an EcoR 
I restriction site. The reaction and purification of the PCR 
product performed as described above. The sequence of the 
resultant PCR modified cpDI DNA fragment having Nho I 
and EcoR I restriction sites is listed in the Sequence Listing 
as SEQ. ID. NO. 114. 

The products of the first and second PCR amplifications 
were then recombined at the nucleotides corresponding to 
cpm membrane anchor overlap and Nhe I restriction site 
and subjected to a second round of PCR using the G3-F 
(SEQ. ID. NO. 1(77) and Lac-B (SEQ. ID. NO. 110) primer 
pair to form a recombined PGR DNA fragment product 
consisting of the following: a 5' Spe I restriction site; a cpDI 
DNA membrane anchor domain beginning at the nucleotide 
residue sequence which corresponds to the amino acid 
residue 198 of the entire mature cpHI protein; an endog- 
enous stop site provided by the membrane anchor at amino 
acid residue number 112; a Nhe I restriction site, a LacZ 
promoter sequence; and a 3* EcoR I restriction site. The 
recombined PCR modified cpm membrane anchor domain 
DNA fragment was then restriction digested with Spe I and 
EcoR I to produce a DNA fragment for directional ligation 
into a pComb phagemid expression vector prepared in 
Example la(iv) and to form a pCombUI phagemid expres- 
sion vector as described in Example lb(ii). 

h. Isolation of Anti-NPN Coding V w DNA Segment 

To prepare modified Fd fragments for recombination with 
the PCR modified cpVUI membrane anchor domain frag- 
ment to form a Fd-cpVUI DNA fusion product PCR ampli- 
fication as described above was performed using Done 2b. 
prepared in Example 2f, as a template. The primers. Hc3 
(SEQ. ID. NO. 105) and AK 7 (SEQ. ID. NO. 106), the 
sequences of which are listed in Table 7. were used in PCR 
to amplify the Fd portion of the Gone 2b and incorporate 
Xho I and Spe I cloning sites along with a cpVTH overlap- 
ping sequence. Hie amplified PCR modified Fd product was 
purified, electrophoresed and isolated from 1% agarose gels 
as described above. The size of the Fd fragment was 680 
base pairs. 

i. Preparation of a DNA Segment Encoding a Portion of the 
Fusion Protein Fd-cpVUI 



56 

The purified PCR modified Fd DNA fragment containing 
cpVUJ overlapping nucleotide sequences prepared above 
was then admixed with the PCR modified cpVTH membrane 
anchor domain fragment to form an admixture. The frag- 

5 ments in the admixture were allowed to recombine at their 
complementary regions. The admixture containing the 
recombined PCR fragments was then subjected to a second 
round of PCR amplification as described above using the 
end primer pair AK 6 (SEQ. ID. NO. 104) and Hc3 (SEQ. 

to ID. NO. 105) (Table 7). The corresponding product of the 
PCR amplification was purified and electrophoresed on 
agarose gels as described above. The PCR product was 
determined to be approximately 830 base pairs (Fd=680+ 
150) confirming the fusion of Fd with cpVUI. The sequence 

15 of the PCR product linking the Fd sequence with the cpVIH 
sequence in frame in a 5* to 3' direction is listed as SEQ. ID. 
NO. 115. The Fd-cpVUI fusion product was then used in 
directional ligations described in Example 2j for the con- 
struction of a pCBAK8-2b dicistronic phagemid expression 

20 vector. 

j. Construction of pCBAK8-2b Dicistronic Expression Vec- 
tor 

To construct a phagemid vector for the coordinate expres- 
sion of a Fd-cpvni fusion protein with kappa light chain, the 

25 PCR amplified Fd-cpVUI fusion product prepared in above 
in Example 21 was first ligated into Clone 2b phagemid 
expression vector isolated from the NPN combinatorial 
library prepared in Example 2f. For the ligation, the 
Fd-cpvni PCR fusion product was first restriction digested 

30 with Xho I and EcoR L Gone 2b phagemid vector was 
similarly digested resulting in the removal of the cloning and 
decapeptide regions. The digested Fd-cpVTH fragment was 
admixed and ligated into the digested Clone 2b at the 
cohesive termini generated by Xho I and EcoR I restriction 

35 digestion. The ligation resulted in operatively linking the 
nucleotide residue sequence encoding the Fd-cpVUI 
polypeptide fusion protein to a second cassette having the 
nucleotide residue sequences encoding the ribosome binding 
site, a pelB leader sequence and the kappa light chain 

40 already present in Gone 2b to form a dicistronic DNA 
molecule in the original Gone 2b phagemid expression 
vector. 

E. coli* strain TGI. was then transformed with the 
phagemid containing the dicistronic DNA molecule and 

45 transformants were selected on ampicillin as the original 
Clone 2b contained an ampicillin selectable resistance 
marker gene. For high efficiency electrc^transforrnation of 
£. coli. a 1:100 volume of an overnight culture of TGI cells 
was inoculated into one liter of L-broth (1% Bacto tryptooc, 

50 0.5% Bacto yeast extract 0.5% NaQ). The cell suspension 
was maintained at 37° C with vigorous shaking to a 
absorbance at 600 nm of 0.5 to 1.0. The cell suspension in 
log phase growth was then harvested by first chilling the 
flask on ice for 15 to 30 minutes followed by centrifugation 

55 in a cold rotor at 4000 g for 15 minutes to pellet the bacteria. 
The resultant supernatant was removed and the bacterial cell 
pellet was resuspended in a total of one liter of cold water 
to form a cell suspension. The centrifugation and resuspen- 
sion procedure was repeated two more times and after the 

60 final centrifugation. the cell pellet was resuspended in 20 ml 
of cold 10% glycerol. The resuspended cell suspension was 
then centrifuged to form a cell pellet. The resultant cell pellet 
was resuspended to a final volume of 2 to 3 ml in cold 10% 
glycerol resulting in a cell concentration of 1 to 3xl0 iO 

65 cells/mi For the electro o^nsformation procedure. 40 ul of 
the prepared cell suspension was admixed with 1 to 2 ul of 
phagemid DNA to form a cell-phagemid DNA admixture. 
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The resultant admixture was mixed and allowed to sit on ice 
for one minute. An electroporation apparatus, for example a 
Gene Pulsar, was set a 25 uF and 2.5 kV. The pulse controller 
was set to 200 ohms. The cell-DNA admixture was trans- 
ferred to a cold 0.2 cm electroporation cuvette. The cuvette 
was then placed in the chilled safety chamber and pulsed 
once at the above settings. To the pulsed admixture. 1 ml of 
SOC medium was then admixed and the cells were resus- 
pended with a Pasteur pipette (SOC medium was prepared 
by admixing 2% Bacto tryptone. 0.5% Bacto yeast extract. 
10 mM Nad. 2.5 roM KC1. 10 mM MgQ 2 . lOmM MgS0 4 . 
and 20 mM glucose). The cells suspension was then trans- 
ferred to a 17x100 mm polypropylene tube and maintained 
at 37° C. for one hour. After the maintenance period, the 
transformed TGI cells were then plated on ampicillin LB 
plates for selection of ampicillin resistant colonics contain- 
ing the phagemid which provided the selectable marker 
gene. 

Ampicillin resistant colonies were selected and analyzed 
for the correct insert size and expression of Fab. Briefly, 
DNA mini preps of selected colonies were prepared for the 
isolation of phagemid DNA. The isolated phagemid DNA 
from each niiniprep was restriction digested with Xho I and 
EcoR I and the digests were electrophoresed on a 1% 
agarose gel. Clone AK16 was selected as an 830 bp fragment 
was visualized on the gels corifirming the insertion of the 
Fd-cpVUI PCR fusion product into digested Clone 2b. 

Gone AK16 phagemid was then restriction digested with 
Xho I and Xba I and the nucleotide residue sequence of the 
dicistronic DNA molecule encoding the Fd-cpVTH fusion 
protein, the ribosome binding site and petB leader sequence 
for expression of the light chain, a spacer region and the 2b 
kappa light chain was isolated by agarose gel electrophore- 
sis. The isolated dicistronic DNA fragment was then li gated 
into a Xho I and Xba I restriction digested pCBAKO 
expression vector prepared in Example lc(ii) to form a 
dicistronic phagemid expression vector designated 
pCBAK8-2b. 

The resultant pCBAK8-2b expression vector consisted of 
nucleotide residue sequences encoding the following ele- 
ments: f 1 filamentous phage origin of replication; a chloram- 
phenicol acetyl transferase selectable resistance marker 
gene; an inducible LacZ promoter upstream from the LacZ 
gene; a multiple cloning site flanked by T3 and T7 poly- 
merase promoters; and the dicistronic DNA molecule (a first 
cassette consisting of a ribosome binding site, a pelB leader, 
and a Fd-cpVTH DNA fusion product operatively linked to 
a second cassette consisting of a second ribosome binding 
site, a second pelb leader, and a kappa light chain), 
k. Construction of pCBAJC3-2b Dicistronic Expression Vec- 
tor 

To construct a phagemid vector for the coordinate expres- 
sion of a Fd-cpm fusion protein with kappa light chain, the 
PGR amplified and recombined cpHI membrane anchor 
prepared in Example 2g having a 5* Spe I and 3* EcoR I 
restriction site was first directionally ligated into a pComb 
phagemid expression vector prepared in Example la(iv) to 
form a pCombUI phagemid vector. See Example lb(ii) for 
details of vector construction. The resultant pCombm 
phagemid vector was then restriction digested with Sac II 
and Apa I to form an isolated fragment. The resultant 
isolated fragment containing the expression control 
sequences and the cpHI sequence was then directionally 
ligated into a similarly digested pCBAKO phagemid vector 
prepared in Example lc(ii) to form a pCBAK3 phagemid 
expression vector. This vector lacked Fd and kappa light 
chain sequences. 
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A phagemid expression vector, pCBAK3-2b. for the 
expression of a fusion protein and kappa light chain was then 
constructed. Briefly, the pCBAK3 phagemid expression 
vector prepared above was first digested with Xho I and Spe 

5 I to form a linearized pCBAK3 phagemid expression vector. 
PCR amplified and modified Fd fragment prepared in 
Example 2h containing Xho I and Spe I sites, was subse- 
quently restriction digested with Xho I and Spe L The 
resultant Fd fragment was then directionally ligated via 

io cohesive termini into the Xho I and Spe I restriction digested 
pCBAK3 phagemid expression vector to form a second 
phagemid expression vector in which the PCR modified Fd 
fragment was operatively linked in-frame to nucleotide 
residue sequences encoding cpEL £. coli strain XLl-Blue 

15 (Stratagene) was then transformed with the above phagemid 
vector containing Fd-cpIH. Transform ants containing the 
Fd-cpHI encoding phagemid were selected on chloram- 
phenicol Phagemid DNA was isolated from chlorampheni- 
col resistant clones and was restriction digested with Sac I 

20 and Xba I to form a linearized phagemid expression vector 
into which a Sac I and Xba I light chain fragment prepared 
below was directionally ligated. 

Phagemid Clone 2b. isolated from the original combina- 
torial library as described in Example 2a. was restriction 

25 digested with Sac I and Xba- 1 to isolate the nucleotide 
residue sequence encoding the kappa light chain. The iso- 
lated kappa light chain sequence was then directionally 
ligated into the Sac I and Xba I restriction digested 
phagemid expression vector prepared above containing 

30 Fd-cpm to form the phagemid expression vector. pCBAK3- 
2b. The resultant vector contained the nucleotide residue 
sequence of a dicistronic DNA molecule for the coordinate 
expression of a Fd-cpm fusion protein with kappa light 
chain. The resultant phagemid expression vector consisted 

35 of nucleotide residue sequences encoding the following 
elements: fl filamentous phage origin of replication; a 
chloramphenicol acetyl transferase selectable resistance 
marker gene; an inducible LacZ promoter upstream from the 
LacZ gene; a multiple cloning site flanked by T3 and T7 

40 polymerase promoters; and the dicistronic molecule (a first 
cassette consisting of a first ribosome binding site and pelB 
leader operatively linked to Fd-cpm operatively linked to a 
second cassette consisting of a second LacZ. a second 
ribosome binding site, and a second pelB leader operatively 

45 linked to a kappa light chain). 

XLl-Blue cells were then transformed with the phagemid 
expression vector pCBAK3-2b. Transformed colonies con- 
taining the chloramphenicol resistant phagemids were 
selected as described above and analyzed for the correct size 

50 insert and expression of Fab as described in Example 2j. 
Following verification of the insert and expression of Fab in 
the pCBAK3-2b phagemid vector, XL 1- Blue cells were then 
transformed and induced for the expression of Fab antibod- 
ies as described in Examples 3 and 4. 

55 3. Expression of Anti-NPN Heterodimer on Phage 
Surfaces 

For expression of antibody Fab directed against NPN on 
phage surfaces. XLl-Blue cells were separately transformed 

60 with the phagemid vectors. pCBAK8-2b and pCBAK3-2b. 
prepared in Examples 2j and 2k. respectively. The transfor- 
mants were selected on LB plates containing 30 ug/ml 
chloramphenicol Antibiotic resistant colonies were selected 
for each phagemid transformation and grown in liquid 

65 cultures at 37° C. in super broth (super broth was prepared 
by admixing me following: 20 g 3 [N-MorpholinoJ propane- 
sulfonic acid (MOPS); 60 g tryptone; 40 g yeast extract; and 
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2 liter of water: adjust pH to 7.0 with 10 m NaOH) 
containing 30 ug/ml chloramphenicol and 12.5 ug/ml tetra- 
cycline for the respective antibiotic selection of the 
phagemid and the F episome. The antibiotic resistant trans- 
formed XL 1 -Blue cells were diluted to an optical density 
(OD^q am) of 0.4 in super broth. The inducer, isopropyl 
thiogalactopyranoside (IFTG). was admixed to the bacterial 
suspension for a final concentration of 1 mM and the 
admixture was maintained at 37° C. for 1 hour to induce the 
expression of the fusion protein and kappa light chain from 
the LacZ promoter. Helper phage, either R408 or VCS M13 
(Stratagene), was then admixed to the induced bacterial 
suspension at a ratio of 10-20 helper phage to 1 transformed 
bacterial cell to initiate the generation of copies of the sense 
strand of the phagemid DNA. The admixture containing the 
helper phage was then maintained for an additional two 
hours at 37° C. to allow for filamentous bacteriophage 
assembly wherein the expressed anti-NPN Fab antibodies 
fused to either bacteriophage membrane anchor domains of 
cpVUI or cpIH were incorporated into surface of the bac- 
teriophage particles. The bacterial suspension was then 
centrifuged resulting in a bacterial cell pellet and a super- 
natant containing phage. The supernatant was removed, 
collected and assayed as described below for the presence of 
functional anti-NPN Fab molecules anchored to the phage 
particles by either cpvTQ or cpIIL 

4. Assays for Verifying the Presence and Function 
of Anti-NPN Hetcrodimer on the Surface of 
Filamentous Phage 

a. Electron Microscopy 

To localize functional Fab molecules, the binding to 
antigen labelled with colloidal gold was studied. Phage 
containing supernatants and bacterial cells prepared in 
Example 3 were spotted on formvar Polysciences. Inc., 
Warrington. Pa.) coated grids affixed onto a solid phase. In 
some experiments grids were coated with cells and infected 
with phage in situ. Subsequently grids were blocked with 
bovine serum albumin (BSA) 1% in PBS at pH 7.2. washed 
and incubated with 2-7 nanometer (run) colloidal gold 
particles coated with BSA-NPN hapten conjugate for a time 
period sufficient to form a labeled immunoreaction complex. 
The grids were washed to remove excess gold particles and 
negatively stained in uranylacetate and visualized by elec- 
tron microscopy. 

Examination of filamentous phage and permeabQized 
cells producing phage revealed specific labelling of phage or 
exposed bacterial membranes. Phage were observed to con- 
tain 1 to 24 copies of antigen binding sites per particle. 
Neither helper phage alone nor intact E. coli labelled with 
antigen. Background nonspecific binding was very low. 
Filamentous phage particles emerging from the £. coli 
surfaces were labelled with antigen as shown in FIG. 9. 

The generation of a related phage surface expression 
vector utilizing cpm as a fusion partner with Clone 2b. 
pCBAK3-2b, revealed specific antigen labelling to the 
phage head but not the column. Additionally human anti- 
tetanus Fab expressed as a cpm fusion did not bind to 
BSA-NPN antigen. 

b. Phase Elisa 

Microtitration plates were coated with NPN-BSA conju- 
gate (0. 1 ml, 1 ug/ml in 0. 1M Tris-HQ pH 9.2). and blocked 
with \% BSA in PBS. Serial two fold dilutions of pCBAKfc- 
2b derived phage (0.1 ml), prepared in Example 3. were 
added to the pre-coated microtitration plate and incubated 
for 3 hours at ambient temperature or 16 hours at 4° C. The 
plates were washed with PBS and goat anti-kappa alkaline 
phosphatase conjugate (Fisher Biotech, Pittsburgh. Pa.) 
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added (0.1 ml diluted 1/1000 in PBS containing 0.1% BSA) 
and incubated for 2 hours at room temperature. The plates 
were washed in PBS and substrate added (0. 1 ml. 1 mg/ml 
p-nitrophenylphosphate in 0.1M Tris-HCl. pH 9.5, contain- 

5 ing 50 mM MgQ 2 ). After incubation at 37° C. for signal 
development, the optical densities at 400 nm were deter- 
mined. Competition assays were performed with the addi- 
tion of increasing amounts of free NPN hapten ranging from 
zero up to 5 mgAvell. 

10 The ELISA assays confirmed the presence of functional 
antibody Fab. In a two site ELISA on NPN antigen coated 
plates when probed with anti-mouse kappa chain enzyme 
conjugate, phage supernatant generated from helper phage 
infection of cells carrying the pCBAK8-2b construct exhib- 

15 ited expected titration curves with serial two fold dilutions 
of phage containing antibody. The results of the two-site 
ELISA are shown in FIG. 10. For a signal to be generated in 
this assay, the phage particle must (i) have functionally 
associated Fd and kappa chains and (ii) be multivalent 

20 Specificity of the particle was assessed by inhibiting binding 
to the plate in the presence of increasing concentrations free 
hapten. The generated phage particles exhibited binding to 
solid phase of the ELISA and could be inhibited by addition 
of hapten as shown in FIG. 11. Complete inhibition was 

25 achieved when 5 ng of free NPN hapten was used in the 
assay. Helper phage did not give a signal in the ELISA. 
c. Antigen Specific Precipitation of Phage 

Phage supernatant from XL 1 -Blue was transformed with 
the pCBAKfc-2b dicistronic expression vector prepared in 

30 Example 3 (1 ml) was incubated with BSA-NPN conjugate 
( 10 ul 2 mg/ml) for 18 hours at 4° C. The mixture was then 
pelleted by centrifugal! on at 3000 rpm on a bench top 
centrifuge and the appearance of precipitate noted. Helper 
phage was used as a control. The pellet was washed repcat- 

35 edly in cold PBS (5x3 ml/wash) and then resuspended in LB 
(0.5 ml). The solubilized precipitates were added to fresh 
XLI-Blue cells (0.5 ml of overnight culture), incubated for 
1 hour at 37° C. and aliquots plated out on LB agar 
containing cMoramphenicol (30 Mg/ml). Colonies were 

40 selected randomly. Colony lifts on nitrocellulose were 
treated with lysozyme to digest the cell wall, briefly treated 
with chloroform to breakdown the outer membrane, blocked 
in BSA 1% in PBS and incubated with l25 I labelled BSA- 
NPN antigen. After several washes in PBS (containing 

45 0.05% Tween-20). film was exposed to the washed and dried 
filter overnight at -70° C. and the autoradiographs were then 
developed. 

Precipitates were obtained with antibody containing 
phage but not helper phage in the presence of BSA-NPN. In 
50 addition, the particles retained infectiviry on subsequent 
incubation with bacterial cells carrying the F episome and 
generated 4xl0 5 colonies from a single solubilized precipi- 
tate. 

Additionally. DNA restriction analysis was carried out to 
55 determine the presence of heavy and light chain inserts. 
. DNA restriction analysis of the clones revealed the presence 
of a. Xho and Xba I fragment of 1.4 kb as expected for 
Fd-cpvm fusion construct and kappa chain insert 
These results give additional evidence for antigen speci- 
60 ficity and muitivaiency. In addition to providing immuno- 
logical parameters, this precipitation offers possibilities for 
facile enrichment of antigen specific phage particles. In 
principle, phage containing specific antibodies can be highly 
enriched by precipitation with antigens (which may be cell 
65 surface markers, viral, bacterial as well as synthetic 
molecules). The washed antigen-antibody precipitates can 
be solubilized by the addition of excess antigen and viable 
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phage recovered. For the recovery of rare species an immo- 
bilized antigen may be used which opens the way for 
differential affinity elution. 

In order to demonstrate the utility of immobilized antigen 
for the enrichment of clones of defined binding specificity, 
a panning experiment was performed. An ampicillin resis- 
tant phagemid expressing an anti-tetanus Fab as a cpVTH 
fusion was constructed. Rescue of this clone with helper 
phage produced phage encoding the ampicillin resistant 
phagemid which displayed the anti-tetanus Fab on their coat. 
These phage encoding tetanus specificity were admixed with 
NFN hapten encoding phage (1:100) and allowed to bind to 
a rmcrotitration plate coated with tetanus toxoid. Following 
a one hour maintenance period, the plate was washed 
extensively and phage were then eluted with a low pH 
buffer. Infection of XLl-Blue cells in log phase growth and 
subsequent plating of aliquots on ampicillin and chloram- 
phenicol allowed for direct quantitation of enrichment. 
Examination of over 1.000 colonies showed that ampicillin 
resistant colonies derived from the eluted phage exceeded 
chloramphenicol resistant colonies by 27 to 1. Therefore, 
panning enriched the phage displaying the anti-tetanus Fab 
by 2700 fold. This result suggests that a clone of defined 
specificity present at one part per million will dominate over 
nonspecific clones following two rounds of panning. 

5. Advantages of Assembling Combinatorial 
Antibody Fab Libraries Along Phage Surfaces 

A powerful technique for generating and selecting com- 
binatorial Fabs. with 10 s-9 members, is presented. In the 
vector described herein, the restriction cloning sites for 
inserting PCR generated antibody fragments have been 
retained as previously reported for the lambda vector. The 
rescue of the genes encoding the antibody Fd and kappa 
chains is mediated through the utilization of the f 1 origin of 
replication leading to the synthesis and packaging of the 
positive strand of the vector on co-infection with helper 
phage. Since the 'mature* virus particle assembles by incor- 
porating the major coat protein around the single stranded 
DNA as it passes through the inner membrane into the 
periplasmic space, not only does it capture the genetic 
information carried on the phagemid vector but also incor- 
porates several copies of functional Fab along the length of 
the particle. On subsequent infection of hosts cells carrying 
the F episomc the phagemid confers resistance allowing 
selection of colonies on the appropriate antibiotic In 
essence, the antigen recognition unit has been linked to 
instructions for its production. 

The full power of the earlier combinatorial system could 
not be fully utilized since screening allowed ready access to 
only about 0.1-1% of the members. In the phagemid/M13 
system similar size libraries are generated and all the mem- 
bers are accessed via affinity selection. Furthermore, unlike 
the lambda vector which generated monovalent Fabs, this 
system generates multivalent particles, thus allowing the 
capture of a wider range of affinities. 

The unique phagemid restriction sites permit the recom- 
bination of Fd and kappa chains allowing chain replacement 
or shuffling. The rescue of filamentous single stranded DNA 
allows rapid sequencing and analysis of the genetic make up 
of the clone of interest. Indeed it can be envisaged that phage 
encoding antibody specificity may be enriched by antigen 
selection prior to DNA sequencing or mutagenesis. The 
option to further develop an iterative process of mutation 
followed by selection may allow the rapid generation of high 
affinity antibodies from germ line sequences. The process 
may be automated. Setting aside the potential of the system 
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to mimic nature, the phagemid/M13 system would allow a 
more complete dissection of the antibody response in 
humans which may yield useful therapeutic and diagnostic 
reagents. 

5 The membrane anchoring of the heavy chain and the 
compartmentalization of the kappa chain in the periplasm is 
the key to expressing this functional dimeric protein. The 
potential of this system is by no means limited to antibodies 
and may be extended to any protein recognition system or 

10 combination of systems containing multiple members. For 
example coupling of ligand and effector systems in a high 
avidity matrix is now possible. In a similar vein a library of 
ligands can be sorted against a library of receptors. 

15 6. Randomized Mutagenesis of the CDR3 Region 
of a Heavy Chain Encoding Tetanus Toxoid 
a. PCR Mutagenesis with Degenerate Oligonucleotides 

To obtain a mutagenized heterodimer of this invention of 
altered specificity that would no longer recognize TT but 

20 would recognize and specifically bind to a new antigen, a 
method was developed to randomize only the CDR3 region 
of a heavy chain fragment encoded by a known nucleotide 
sequence. This approach is schematically diagrammed in 
FIG. 12 where a representative heavy chain fragment within 

25 a phagemid clone, consisting of alternating framework 
regions (1 through 4) shown by white blocks and comple- 
mentarity detenmning regions (CDR) (1 through 3) shown 
by cross-hatched blocks and the first constant region (CHI), 
is subjected to two separate rounds of PCR- In the first PCR 

30 amplification reaction, the 5' end of the heavy chain begin- 
ning at framework 1 and extending to the 3* end of frame- 
work 3 is amplified. In the second PCR amplification 
reaction, the CDR3 region is randomly mutagenized shown 
by the black box. This is accomplished through the use of a 

35 pool of oligonucleotide primers synthesized with a degen- 
erate region sandwiched between and contiguous with con- 
served framework 3 and 4 region sequences. The resulting 
amplification products, each having a randomized CDR3 
region, begin at the 3' end of framework 3 and extend to the 

40 3' end of the CHI region. The pool of degenerate oligo- 
nucleotide primers have been designed to result in the 
amplification of products having a 5* end that is comple- 
mentary to and will overlap with the 3 f end of the products 
of the first PCR reaction product Thus, the two separate 

45 PCR reaction products are pooled and subjected to a third 
PCR reaction in which the overlapping region between the 
two products is extended to result in heavy chain having a 
randomized CDR3 region. 

A heavy chain DNA template for use in this invention was 

50 available in a clone (a phagemid vector containing heavy 
and light chain fragments) from a human combinatorial 
anti-tetanus toxoid (TT) Fab library. This library was con- 
structed in the pCBAK-3 dicistronic expression vector for 
the expression of a heavy chain-cplH fusion protein (Fd- 

55 cpIH) and a soluble light chain as described for anti-NPN in 
Example 2k and by Persson et aL, Proc. Natl. Acad Sct\, 
USA, 88:2432-2436 (1992) and Barbas et aL, Proc. NatL 
Acad. ScL, USA, 88 :7978- 7982 (1992). A clone, hereinafter 
referred to as pCE-TT7E, was expressed as described for 

60 anti-NPN heterodiraers on phage surfaces in Example 3 and 
subsequently screened by panning on TT-coated pl ates as 
described for anti-NPN in Example 4c . Clone pCErTT7E 
exhibited a K rf towards TT on the order of 1(T 7 M and was 
enriched over nonspecific phage by ltf-f old as described by 

65 Barbas et aL, supra. Clone pCE-THE. having both heavy 
and light chain sequences, was used as the template DNA for 
the randomized mutagenesis of the CDR3 region of the 
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heavy chain to alter antigen binding specificity as described One hundred nanograms of gel purified products from the 
herein The sequence of the heavy chain was determined as first and second PCR reactions were then admixed with 1 ug 
described in Example la(ii). TVo separate PCR reactions each of FTX3 and CG1Z oligonucleotide primers as a 
were performed as illustrated in FIG. 12. primer pair in a final PCR reaction to form a complete heavy 

The first PCR reaction resulted in the amplification of the 5 chain fragment by overlap extension as illustrated in FIG. 
region of the heavy chain fragment in the pC3-TT7E clone 12. The PCR reaction admixture also contained 10 ui 
beginning of framework region 1 and extending to the end lOxPCR buffer. 1 ul Taq polymerase and 8 ui 2.5 raM d 
of framework region 3 which is located 5* to CDR3 which KIT* s as described above. The PCR reaction was performed 
is approximately 400 base pairs in length. To amplify this as described above. To obtain sufficient quantities of ampli- 
region, the following primer pairs were used. The 5 anii- 10 Nation product. 15 identical PCR reactions were performed 
sense oligonucleotide primer. FT3X. having the nucleotide reS ulting heavy chain fragments beginning at frame- 

sequence 5'-G-CAA-TAA-ACC-CTC-ACr-AAA-GGG-3' wor k 1 and extending to the end of CHI and having 
(SEQ ID NO 118). hybridized to the non-coding strand of randomly mutagenized CDR3 regions were approximately 
the heavy chain corresponding to the region 5* of and 790 base pairs in length. The heavy chain fragment ampli- 
induding the beginning of framework 1. The 3' sense 15 fi cat jon products from the 15 reactions were first pooled and 
oligonucleotide primer, B7EFR3, having the nucleotide tncn gc i purified as described above prior to their incorpo- 
sequence 5'-tCT-CGC-ACA-ATA-ATA-CAC-GGC-3' rat i 0 n into a phagemid library. 
(SEQ ID NO 119). hybridized to the coding strand of the b phagemid Library Construction 

heavy chain corresponding to the J end of the firamework-3 The resultant gel purified heavy chain fragments prepared 
region. The oligonucleotide primers were synthesized by ^ - in Example 6a were then digested with the restriction 
Research Genetics (Hunstville. Ala.). The PCR reaction was enzymes. Xho I and Spe L as described in Example 2d. The 
performed in a 100 ui reaction containing one ug of each of reS ultant digested heavy chain fragments were subsequently 
oligonucleotide primers FTX3 and B7EFR3. 8 ul 2.5 mM gc i purified prior to insertion into the pC3 TT7E phagemid 
dNTFs (dXTP. dCTP, dGTR aTTP). 1 ul Taq polymerase. vector clone which was previously digested with the same 
10 ng of template p€E-TT7E. and 10 ul of lOxPCR buffer 25 res triction enzymes to remove the non-mutagenized heavy 
purchased commercially (Promcga Biotech). Two drops of cnain fragment and form a linear vector. Ligation of 640 ng 
mineral oil were placed on top of the admixture and 35 0 f lnc heavy chain Xho I/Spe I fragments having 
rounds of PCR amplification in a thermocycler were per- mutagenized CDR3 regions into two ug of the linearized 
formed. The amplification cycle consisted of denaturing at pC3-TT7E phagemid vector to form circularized vectors 
94 C. for one minute, annealing at 50° C. for one minute. 30 having mutagenized CDR3 regions was performed over- 
followed by extension at 72° C. for two minutes. The ^grit # room temperature using 10 units of BRL ligase 
resultant PCR amplification products were then gel purified (Gaithersburg. Md.) in BRL ligase buffer in a reaction 
as described in Example Id and used in an overlap extension volume of 150 ui Five separate ligation reactions were 
PCR reaction with the products of the second PCR reaction. performed to increase the size of the phage library having 
both as described below, to recombinc the two products into 35 m utagenized CDR3 regions. Following the ligation 
reconstructed heavy chains containing mutagenized CDR3 reactions, the circularized DNA was precipitated at -20° C. 
regions as illustrated in FIG. 12. for two hours by the admixture of 2 ul of 20 mg/ml 

The second PCR reaction resulted in the amplification of glycogen. 15 ul of 3M sodium acetate at pH 5.2 and 300 ul 
the heavy chain from the 3* end of framework region 3 of ethanol. DNA was then pelleted by microccntrilugatioa at 
extending to the end of CHI region which is approximately ^ 4° rj. for 15 minutes. The DNA pellet was washed with cold 
390 base pairs in length. To amplify this region, the follow- 7 q<£ ct hanol and dried under vacuum. The pellet was 
ing primer pairs were used. The 5* anti-sense oligonucleotide resuspended in 10 ui of water and transformed by electropo- 
rtfimer pool, designated 7ECDR3. had the nucleotide ration into 300 ul of E. coli XLl-Blue cells as described in 
sequence represented by the formula. Example 2k to form a phage library. The total yield from the 

5'^.tat-TAT-TCT^kxj-aga-nns-NNS-NNS-NNS-NNS. 45 mutagenesis and transformation procedure described herein 

NNS-NNS-NNS-NNS-NNS-NNS-NNS-NNS-NNS-NNS-NNS- was approximately 5x10 transformants. 

tgg-OGC-caa-ggg-acc-acg-3- After transformation, to isolate phage on which het- 

where N can be A. C. G. or T and S is either C or G (SEQ erodimer expression has been induced for subsequent pan- 
ID NO 120). wherein the 5' end of the primer pool is ning on target antigens such as fi"«f^ J ml of SOC 
complementary to the X end of framework 3 represented by 50 medium (SOC was P^^f^l £ « 8 iS?5 
the Complementary nucleotide sequence of the oligonucle- tryptone. 5 g yeast extract and 0.5 g NaQ in one I erof 
Sde primer B73FR3 and the T end of the primer pool is water, adjustmg me pHto^ 

complementary to the 5' end of framework 4. The region just before use to induce the expression of the Fd-cpltt and 
SSenTe^o specified ends of the primer pool is repre- light W**^ was ^ r ^^™ 
sented by a 48-mcr NNS degeneracy which ultimately 55 shaken at 220 rpm for one ihour at 37 C. •J^* 1 } 11 ; 
encodesa'diversepopulationof mutagenized CDR3 regions of SB (SB was preyed bya^g30g ^ypto^Og 
ctl6^omdJ^slnl^T^y^^^^ y^t extract, and 10 g Mops ^^^.^P" 
otide primer. CG1Z. as described by Persson et aL. supra, adjusted to 7) containing 20 ^jate^ ™*}° 
having the nucleotide sequence ug/mi tetracycline and the admixture was .shaken at 300rpm 
v nr AT^TArTAGTTT^GTCACAAGATTTGGG-3 , 60 for an additional hour. This resultant admixture was admixed 
f^m m)^S^^^ 0f * C tolOOnnSBconta^^ 

heawSail ^corresponding to the 3' end Ac CHI. The tetracycline and shaken for one hour after which helper 
Sd S Swai formed on the pC3-TT7E in a phage VCSM13 (10*> pfu) ^^^ ^^0 
l(X>ulreactionasdescrib^ was shaken for an 70 

of oligonucleotide primers 7ECDR3 and CGIZL The result- 65 ug/ml kanamycin was admixed C. 
ant PCR an^lification product was then gel purified as overnight The lower temperature resulted ui tetter bet- 
described above erodimer incorporation on the surface of the phage. The 
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supernatant was cleared by centrifugation (4000 rpra for 15 
minutes in a MIOrotor at 4° C). Phage were precipitated by 
admixture of 4% (w/v) polyethylene glycol 8000 and 3% 
(w/v) NaCl and maintained on ice for 30 minutes, followed 
by centrifugation (9000 rpm for 20 minutes in a JA10 rotor 
at 4° C). Phage pellets were resuspended in 2 ml of PBS and 
microcentrifuged for three minutes to pellet debris, trans- 
ferred to fresh tubes and stored at -20° C. for subsequent 
screening as described below. 

For determining the titering colony forming units (cfu). 
phage (packaged phagemid) were diluted in SB and 1 ul was 
used to infect 50 ul of fresh (AOD600=1) £. coU XLI-Blue 
cells grown in SB containing 10 ug/mi tetracycline. Phage 
and cells were maintained at room temperature for 15 
minutes and then directly plated on LB/carbenicillin plates, 
c Selection of Anti-Fluorescein Heterodimers on Phage 
Surfaces 

1) Multiple Pannings of the Phage Library Having 
Mutagenized CDR3 Regions 
The phage library produced in Example 6b having heavy 
chain fragments with mutagenized CDR3 regions was 
panned as described herein on a microtiter plate coated with 
a 50 ug/ml fluorescein-BSA conjugate to screen for anti- 



Following each round of panning, the percentage yield of 
phage must be determined, where % yield— (number of 
phage eiuted/number of phage applied) xIOO. 

As an alternative to elution with acid, phage bound to the 
5 wells of the microliter plate were eluted by admixing 50 ul 
of a solution of 10" 5 M fluorescein diluted in PBS followed 
by a maintenance period of one hour at 37 C The solution 
was then pipetted up and down to wash the wells. The 
resultant eluate was transferred to 2 ml of fresh E. coli 
10 XLI-Blue cells for infection as described above for prepar- 
ing phage and further panning. In subsequent rounds of 
panning, phage were eluted with 10^M fluorescein. 

The results of the amount of phage that were specifically 
bound to fluorescein-coated wells over four consecutive 
15 rounds of panning and eluted with acid or with fluorescein 
alone are shown below in Table 8. Comparable yields of 
phage on which heterodimers were expressed mat bound 
specifically to fluorescein were achieved with either elution 
protocol Approximately 20 clones of the 5xl0 7 clones 
20 resulting from the mutagenesis and transformation exhibited 
specificity of binding towards fluorescein-coated wells. 
These data confirm that mutagenesis of the CDR3 region as 
described in this invention resulted in the altering of a 
heterodimcr which initially specifically bound to TT to one 



a 3u ug/rai uuorcitcuj-con — .~ — - — heterodimcr wnicn initially specinc 

fluorescein heterodimers. Fluorescein was conjugated to 25 ^ spccificaUy bound fluorescein. 
BSA according to the methods described in "Antibodies: A 
Laboratory Manual", eds Harlow et al.. Cold Spring Harbor 



Laboratory. 1988. 

The panning procedure described was a modification of 
that originally described by Parmley and Smith (Parmley et 30 
al.. Gene, 73:30-5-318). Two to four wells of a microtiter 
plate (Costar 3690) were coated overnight at 4° C. with 25 
ul of 50 pg/ml antigen prepared above in 0-1M bicarbonate. 
pH 8.6. The wells were washed twice with water and 
blocked by con^letelynllmgme well with 3% (wA^) bovine 35 
serum albumin (BSA) in PBS and incubating the plate at 37° 
C. for 1 hour. Blocking solution was shaken out. 50 ul of the 
phage library prepared above (typically 10 u cfu) was added 
to each well, and the plate was incubated for 2 hours at 37° 

Phage were removed and the plate was washed once with 
water. Each well was then washed 10 times with TBS/Twee n 
(50 mM Tris-HCl, pH 7.5. 150 mM Nad. 0.5% Tween 20) 
over a period of 1 hour at room temperature —pipetted up 
and down to wash the well, each time allowing the well to 
remain completely filled with TBS/Tween between wash- 
ings. The plate was washed once more with distilled water 
and adherent phage were eluted by the addition of 50 pi of 
elution buffer (0.1M HC1 adjusted to pH 22 with solid 



TABLE 8 





Phase Eluted 






Acid Ehukra 


Fluorescein EIu&od 


round 1 


5.6 x lOVwelJ 


4.7 x ltf/weD 


round 2 


4.6 x ICfWll 


5.6 x lO'/wcU 


round 3 


3.75 x lOVwell 


1.35 x lO*/wen 


round 4 


131 x 10*/wcU 


4.0 x 10*/wcD 



40 



45 



2) Preparation of Soluble Heterodimers for 
Characterizing Binding Specificity to Fluorescein 
In order to further characterize the specificity of the 
mutagenized heterodimers expressed on the surface of phage 
as described above, soluble heterodimers were prepared and 
analyzed in ELISA assays on fluorescein-coated plates, by 
competive ELISA with increasing concentrations of soluble 
fluorcscein-BSA and also by fluorescence quenching assays. 
The latter assays were performed as described in 'fluores- 
cein Hapten: An Irnmunological Probe", ed E. W. Voss. CRC 
Press, Inc. pp 52-54. 9S4. 



elution Duner iuim -ju^ «, F „ ~ — ~~ To prepare soluble heterodimers . phagemid DNA from 
Elycine containing 1 mg/ml BSA) to each well and incu- 50 positive clones was isolated and digested with Spel and 
r*' * . t n ♦„ tv— A i.,w^ n mkaT ns^ctiVm with the.nA enzvmes Droduces comDatiDle 



barion at room temperature for 10 minutes. The elution 
buffer was pipetted up and down several times, removed, 
and neutralized with 3 Ml of 2M Tris base per 50 ul of elution 
buffer used. Eluted phage were used to infect 2 ml of fresh 



NheL Digestion with these enzymes produces compatible 
cohesive ends. The 4.7-kb DNA fragment lacking the gin 
portion was gel-purified (0.6% agarose) and self-ligated. 
Transformation of E. coli XLI-Blue afforded the isolation of 



buner usea tuuxea pnagc wexe uacu iw iuiva,i * ^ *- . 

ron -n E. coU XLI-Blue cells far 15 minutes at room 55 recombinants lacking the gin fragment Clones were exam- 
VWfi00 " 1 ' J TT. . * ™ ;~-a r.mnv^l ft f the cTTT fraffment bv Xhol/Xbal 



temperature, after which 10 ml of SB containing 20 ug/ml 
carbenicillin and 10 ug/ml tetracycline was admixed. 
(Aliquot (20. 10. and Vxo ul) were removed fox plating to 
determine the number of phage (packaged phagemids) that 
were eluted from the plate.] The culture was shaken for 1 
hour at 37° G. after which it was added to 100 ml of SB 
containing 50 Mg/ml carbenicillin and 10 ug/ml tetracycline 
and shaken for 1 hour. Then helper phage VCSM13 (10 
pfu) were added and the culture was shaken for an additional 



ined for removal of the gin fragment by Xhol/Xbal 
digestion, which should yield an 1.6- kb fragment Clones 
were grown in 100 ml SB containing 50 pg/ml carbenicillin 
and 20 mM MgCl 2 at 37° C. until an OD^o of 0.2 was 
60 achieved. ETG (1 mM) was added and the culture grown 
overnight at 30° C. (growth at 37° C. provides only a light 
reduction in hetercKlimer yield). Cells were pelleted by 
centrifugation at 4000 rpm for 15 minutes in a J A 10 rotor at 
4° C. Cells were resuspended in 4 ml PBS containing 34 



ofu j were added and the culture was snaa.cn its «ui auuiuuuw -» ^. ~w*w — 1 ■ ~ 

2 hours. After this time. 70 Mg/ml kanamycin was added and 65 pg/ml phenylinethylsulf onyl fl "^^^^^ 
the culture was incubated at 37° C. overnight Phage prepa- sonication on ice (2-4 minutes at 50% duty). Debns ; was 
ration and further panning were repeated as described above. 



pelleted by centrifugation at 14.000 rpm in a JA20 rotor at 
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4° C for 15 minutes. The supernatant was used directly for Insofar as immunoglobulin genes have three CDR regions 

ELISA analysis and was stored at -20° C For the study of on both the heavy chain and the light chain of an 

a large number of clones. 10-ml cultures provided plenty of immunoglobulin, each separated by a distinctive framework 

heterodimer for analysis. In this case, sonications were region, it is to be understood that the above example is 

performed in 2 ml of buffer. 5 readily applicable to introducing mutations into a specific 

The soluble heterodiraers prepared above were assayed by CDR by selection of the above 5' and 3' nucleotide 

ELISA For this assay. 1 ug/weli of fluorescein-BSA solu- sequences as to hybridize to the framework regions flanking 

tion was admixed to individual wells of a microliter plate the targeted CDR. Thus the above first and second frarae- 

and maintained at 4° C. overnight to allow the protein work sequences can be the conserved sequences flanking 

solution to adhere to the walls of the well. After the |0 CDR1 CDR2 or CDR3 ion either th, : heavy or Ugbt chain, 

maintenance period, the wells were washed one time with Exemplary and referred is the CDR3 of the human immu- 

PBS and thereafter maintained with a solution of3%BSA to noglotaun heavy chain 

block nonspecific sites on the wells. The plates were main- /H* length of the 3 and 5 <^ ^ 

Udncd at 37° C for one hour after which time the plates were * a su ^* mutagemzing ohgonucleotide can vary in length 

lainca aijw <_iui uuc wiu , ..^ e„ lllK ,„ as is well known, so long as the length provides a stretch of 

inverted and shaken to remove the ^ -is nucleotides complementary to Ac target framework 

heterodimers prepared above were then admixed to each nce$ as to hybridize mcicto . the case of the 3'- 

well and maintained at 37° C. for one hour to form an terminal nucle oUde sequence, it must be of sufficient length 

immunoreaction products. Following the maintenance ^ complementarity to the target framework region located 

period, the wells were washed 10 times with FBS to remove 3. t0 ^ e q)R region to be mutagenized as to hybridize and 

unbound soluble antibody and then maintained with a sec- ^ proy^ a 3' hydroxy! terminus for initiating a primer exten- 

ondary goat anti-human FAB conjugated to alkaline phos- sion reaction. In the case of the 5' terminal nucleotide 

phatase diluted in PBS containing 1% BSA. The wells were sequence, it must be of sufficient length and complementa- 

maintained at 37° C. for one hour after which the wells were ^ t0 the target framework region located 5' to the CDR 

was 10 times with PBS followed by development with region to be mutagenized as to provide a means for hybrid- 

p-nitrophenyl phosphate. 25 tong in a PCR overlap extension reaction as described 

Immunoreactive heterodimers as determined in the above above to assemble the complete immunoglobulin heavy or 

ELISA were then analyzed by competition HI ISA to deter- light chain. 

mine the affinity of the mutagenized heterodimers. The Framework regions flanking a CDR are well characterized 

FT tsa was performed as described above with increasing in the immunological arts, and include known nucleotide 

concentrations of soluble fluorescein-BSA ranging in con- 30 sequences or consensus sequences as described elsewhere 

centration from 10r*M up to lO" 5 *! in concentration herem. Where a singe, pr^ 

admixed in the presence of the soluble heterodimers. Maxi- to be mutagenized the ^ WOTk - d ^^" c ^5 v S 

mal inhibition of binding was achieved at a concentration of a particular CDR arc toown ^ r ^^^ 

T*M free antigen with a half-maximal inWbition obtained ^ by nudeoude sequencing ^^^J^ 

" . Al ,rt-7 lJ( r tv,,, t u^ toire of immunoglobulin genes are to be mutagenized. the 

with >PP™u»td* 10 ^.J'*' JPjJJ?" 35 framework-derived sequences are preferably conserved, as 

mutagenized heterodimers of this invention specifically rec- 

ognize and bind to fluorescein. Additional experiments were Ascribed elsewhere herein. 

Sormed to confirm that the mutagenized hdexodimers no Preferably, the length of One 3V and 5 

longer recognized the TT to which they nonmutagenized sequences are each at least 6 nudeotides in length, and can 

hetfrodimer originally bound. Fluorescence quenching « be up to 50 or more nucleotides in length, although these 

assays were also performed to confirm the specificity of lengths are unnecessary to assure accurate and «F»duabte 

bindLg of the muTagenized heterodimers. Soluble net- hybridization. Preferred are lengths in the range of 12 to 30 

erodimers prepared from phage that were timer eluted with nucleotides, and typically are about 18 nucleoudes. 

acid or with fluorescein alone were equally effective at A particularly preferred framework-defined nucleotide 

binding fluorescein by any of the aforementioned^ sequence for use as a 3' 'f^^J™!^ 

approaches. The invention of mutagenesis of the CDR3 nucleotide sequence 5 -TGGGGCCAAGGGACCACO-3 

region of the heavy chain of a heterodimer described herein (SEQ ID NO 122). 

thus resulted in the alteration of binding specifidty from TT A particularly preferred framework-defined nucleotide 

to fluorescda sequence for use as a 5* terminus nucleotide sequence has the 

Thus, the above example illustrates a method according to 50 nucleotide sequence 5' - GTGTATTATTGTGC G A G A-3 ' 

the present invention for mutagenizing the complementarity (SEQ ID NO 123). 

dcterrnining region (CDR) of an immunoglobulin gene, and The nucleotide sequence located between the 3' and 5* 

also illustrates oligonudeotides useful therefor. termini adapted for mutagenizing a CDR can be any nude- 

In one embodiment, therefore, an oligonucleotide is con- otide sequence, insofar as the novd sequence will be incox- 

templated that is useful as a primer in a polymerase chain 55 porated by the above methods. However, the present 

reaction (PCR) for inducing mutagenesis in a complemen- approach provides a means to produce a large population of 

tarity determining region (CDR) of an immunoglobulin mutagenized CDR's in a single PCR reaction by -the use of 

gene. Tie ohgonucleotide has 3* and 5' termini and com- a population of redundant sequences defimng randomized or 

prises (1) a nudeotide sequence at its 3' terminus capable of nearly randomized nucleotides in the CDR region to be 

hybridizing to a first framework region of an immunoglo- 60 mutagenized. 

bulin gene. (2) a nucleotide sequence at its 5 f terminus A preferred oligonucleotide comprises a nucleotide 

capable of hybridizing to a second framework region of an sequence between the above described 3' and 5* termini that 

immunoglobulin gene, and (3) a nudeotide sequence is represented by the formula: |NNS] n or |NNK1„. wherein 

between the 3* and 5' termini adapted for introducing muta- N can independently be any nucleotide, where S is G or C 

tions during a PCR into the CDR region between the first 65 K is G or T. and where n is from 3 to about 24. In preferred 

and second framework regions of the immunoglobulin gene, einrxxliments the preferred oligonudeotides have the for- 

thereby mutagenizing the CDR region. raula; 
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5'<nTJIXITATTGTGCGAGA[NNS L.TGGGGCC AAGGG AC - 
CACG-3* (SEQ ID NO: 124) and 5 '-OTGTATTAi 1 U J GC- 
GAGAfNNKl.TGGGGCCAAGGGACCACG-3* (SEQ ID NO: 
123). 

Exemplary and particularly preferred is the oligonucleotide 5 
where with the formula lNNS] n and n is 16. such that the 
oligonucleotide represents a large population of redundant 
oligonucleotide sequences. 

The invention also contemplates a mutagenesis method 
for altering the immunological specificity of a cloned immu- 10 
nologlobulin gene. The method provides direct mutagenesis 
in a preselected CDR of an immunoglobulin gene which 
comprises subjecting a recombinant DNA molecule (rDNA) 
containing the cloned immunoglobulin gene having a target 
CDR to PCR conditions suitable for amplifying a prese- 15 
lected region of the CDR. In the method, the rDNA molecule 
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is subjected to PCR conditions that include a PCR primer 
oligonucleotide as described above constituting the first 
primer in a PCR primer pair as is well known to produce an 
amplified PCR product that is derived from the preselected 
CDR region but that includes the nucleotide sequences of 
the PCR primer. The second oligonucleotide in the PCR 
amplifying conditions can be any PCR primer derived from 
the immunoglobulin gene to be mutagenized. as described 
herein. 

Preferred are methods using an oligonucleotide of this 
invention as described above. 

The foregoing is intended as illustrative of the present 
invention but not limiting. Numerous variations and modi- 
fications can be effected without departing from the true 
spirit and scope of the invention. 



SEQUENCE LISTTNO 

( l ) GENERAL INFORMATION: ■ 

( i i i ) NUMBER OF SEQUENCES: 125 

< 2 ) INFORMATION FOR SEQ ID NO:l: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 173 boe pirin 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: douHe 
( D ) TOPOLOOY: Hnca 

( i k ) MOLECULE TYPE: DNA (gcacanic) 

( i i- i ) HYPOTHETICAL: NO 

{ i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESOUFTION: SEQ ID NO:l : 
GGCCOCAAAT TCTATTTCAA OOAOACAOTC ATAATOAAAT ACCTATTOCC TACOGCAOCC 60 
OCTOOATTOT TATTACTCGC TOCCCAACCA OCCATOOCCC AGGTGAAACT OCTCOAGATT 120 
TCTAOACTAO TTACCCOTAC GACOTTCCOO ACTACGOTTC TTAATAOAAT TCO 173 

( 2 ) INFORMATION FOR SEQ ID NOO; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 173 bese pair* 
( B ) TYPE: midcac tod 
( C ) STRANDEDNESS: douhte 
( D ) TOPOLOGY: Era 

( i i ) MOLECULE TYPE: DNA (gcoamic) 

( l i i ) HYPOTHETICAL: NO 

( t v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ LD NO-2: 
TCGACGAATT CTATTAAOAA CCOTAOTCCO OAACGTCOTA COGOTAACTA OTCTAOAAAT 60 
CTCOAOCAOT TTCACCTOOO CCATOOCTOO TTOOGCAGCO AOTAATAACA ATCCAOCOOC 120 
TGCCOTAGGC AATAGOTATT TCATTATOAC TGTCTCCTTG AAATAGAATT TOC 173 

( 2 ) INFORMATJON FOR SEQ IDNOS: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 131 baepan 
( B ) TYPE: po ckT C cad 
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{ C ) STRANDEDNESS: double 
( D ) TOPOLOGY: fine* 

( i i > MOLECULE TYPE: DNA (gcoocmc) 

( i i i > HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( t j ) SEQUENCE DESCRIPTION: SEQ ID N03: 
TOAATTCTAA ACTAOTCGCC AAGGAOACAG 
CCGCTOGATT OTTATTACTC GCTOCCCAAC 
T T A AG COG CC O 

( 2 ) INFORMATION FOR SEQ ID NO:* 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 139 base par* 
( B )TYPE: xuckac arid 
( C ) STRANDEDNESS: docble 
( D ) TOPOLOGY: fine* 

( j i ) MOLECULE TYPE: DNA (genomic) 

| i i i ) HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:* 
TCOACGGCCO CTTAACTCTA OAACTOACGA 
ATAACAATCC AGCGGCTGCC OTAOOCAATA 
CTAOTTTAOA ATTCAAOCT 

( 2 ) INFORMATION FOR SEQ ID NOS : 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 21 anoao tods 
( B ) TYPE: anno acid 
( D )TOPOLOOY: oaknown 

. ( i i ) MOLECULE TYPE: peptide 

< 1 i i ) HYPOTHETICAL: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO-5: 

Met Ly» Tyr Leu Leo Pro Thr Alt Ala Ala Q\f Leu Leu Leu Leu A 1 a 

i * 1 0 15 

Ala OIo Pro Ala Met 

2 0 

( 2 ) INFORMATION FOR SEQ ID NO*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 aura acid* 
( B )TYPE: Motao acid 
( D ) TOPOLOGY: uakwrn 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Erwkaia carottmaa 

( x i ) SEQUENCE DESCRIPTION: 5BQ ID NOrfs 
Mot Lt. Tyr L.d Leo Pro Thr Ala Ala Ala Oty Leu Leu Leo Leu Ala 
1 5 10 13 



TCATAATOAA AT ACCTATTG CCTACGOCAO 60 
CAOCCATOGC COAOCTCGTC AOTTCTACAG 120 

1 3 1 



OCTCOGCCAT OGCTGOTTGO OCAOCOAOTA 60 
OOTATTTCAT TATOACTOTC TCCTTOGCOA 120 

1 3 9 
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Ala G I a Pro Alt Gin Pro Ala Met Ala 

2 0 2 5 



( 2 ) INFORMATION FOR SEQ rD NO:7: 

( ■ ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOOY: unknown 

( i j ) MOLECULE TYPE: peptide 

(iii ) HYPOTHETICAL: NO 

( t i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Erwmia canXcmra 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

M«t L y • Soi Lou lie Tbr Pro lie Ala Ala O 1 y Leu Leo Leu Ala Pic 
, 5 10 15 

Set OId T y r Set Leo Ala 

2 0 



( 2 ) INFORMATION FOR SEQ ID NO: ft: 

( t ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 28 ammo acid. 
{ B ) TYPE: annuo acid 
( D ) TOPOLOGY: unknown 

( » i ) MOLECULE TYPE: peptide 

(iii ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID 140:8: 

Met Ly i lie L y « Thr O I ; Ala Arg lie Leo Ala Leu Ser Ala Leu Tbr 
1 5 JO 15 

Tbr Mel Met Pbc Ser Ala Ser Ala LeuAla Ly i lie 

2 0 2 5 



( 2 ) INFORMATION FOR SEQ ID NO*. 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 amino acadi 
( B ) TYPE: amino acid 
( D ) TOPOLOOY: unknown 

( i i ) MOLECULE TYPE: peptide 

(iii ) HYPOTHETICAL: NO 

( x *l ) SEQUENCE DESCRIPTION: SEQ ID NO* 

Met Met Ly* Arg A»n lie Lev Ala Val lie Val Pro Ala Lea Leo Val 

1 5 10 15, 

Ala Oly Tbr Ala Ain Ala Ala Gin 

2 0 



( 2 ) INFORMATION FOR SEQ ID NOrlO: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 amino math 
( B ) TYPE: anriao acid 
( D ) TOPOLOGY: unknown 

( 1 i ) MOLECULE TYPE: peptide 

(iii ) HYPOTHETICAL: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NOtlO: 
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Mci Lyi Ola Set Tbr Il-e Ala Leu Ala Leo Leo Pro Leu Leu Phe*Tbr 
l 5 10 15 

ProV»lThjLy»AlaArgTbr 
2 0 

(2 ) INFORMATION FOR SEQID NOtll: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 25 amino *oda 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: unknown 

{ i i ) MOLECULE TYPE; pelade 

{ i i i ) HYPOTHETICAL: NO 

( * i ) SEQUENCE DESCRIPTION: SEQID NOtll: 

Met Ser Me Oil Hi* Pbe Aig Val Ala Leo [le Pio Pbe Phe All Ala 
t 5 10 15 



Phe Cyi Leo Pro Vat Phe Ala Hii Pro 

•2 0 2 5 

( 2 ) INFORMATION FOR SEQ ID NO:12: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 amino acid* 
{ B > TYPE; «niao acid 
( D ) TOPOLOOY: unknown 

( i i ) MOLECULE TYPE: peptide 

f i i i ) HYPOTHETICAL: NO 

( x t ) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Met lie Tbr Leu Arg Lyi Leo Pro Let Ala Val Ala Val Al* Ala 
I 5 10 15 

Oly Val Met Ser Ala Ola Ala Met Aim Val Alp 

2 0 2 5 



( 2 ) INFORMATION POR SBQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 22 amino ackb 
( B ) TYPE: amino acid 
( D ) TOPOLOOY: unknown 

( i i ) MOLECULE TYT^ peptide 

( i i i ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO. 13: 

Met Ly» Ala Tbr Lr» Leu Val Leo Oly Ala Val Me Leo Oly Set Tbr 

, a 10 is 

Leu Leu Ala Oly Cy* Ser 

2 0 

< 2 ) INFORMATION POR SEQ ID NO: 14: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 amino eod» 
( B ) TYPE: aanno acid 
< D >TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: pepbde 

(Mi ) HYPOTHETICAL: NO 

SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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Met L y * Ly t Ser Leo Val Lea L y * Ala Ser Val Ala V»l Ala Tbr Leo 
I 5 10 13 

Val Pro Mei Leu Ser Pbe Ala 
2 0 

( 2 ) INFORMATION FOR SEQ ID NO. 15: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 18 acmno acids 
( B ) TYPE: annno acid 
( D ) TOPOLOGY: unknown 

f i i ) MOLECULE TYPE: peptide 

(iii ) HYPOTHETICAL: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Lyi Ly i Leo Leo Phe Ala lie Pro Lea Val Val Pro Pbe Tyi Ser 

H i i Ser 

( 2 ) INFORMATION FOR SEQ ID NO:16: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 211 annuo acid* 

< B ) TYPE: annuo acid 

{ D ) TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: protein 

(iii ) HYPOTHETICAL: NO 

{ t ) FRAOMENT TYPE: internal 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:l6: 

•Pro Phe Val Cys Oln Tyr Oln Gly Gin Gly Ola Ser Ser Aip Leo Pro 
1 5 10 15 

Oln Pro Pro Val Ain Ala Gly Gly Gly Ser OJy Gly Gly Ser Oly Gly 
2 0 2J 3 0 

Gly Ser ON Gly Oly Oly Ser OU Gly Oly Gly Ser Oln Oly Oly Gly 
35 40 45 

Ser Olo Oly Oly Oly Ser Oly Gly Oly Ser Oly Ser Oly Aip Pbe Aip 
5 0 5 5 6 0 

Tyr Olo Lyi Met Ala Acs Ala Asa Ly i Oly Ala Met Tbr Olo Asa Ala 

65 70 7 5 SO 

A*p Olo Asa Ala Loo Ola Ser Aip Ala Lyi Oly Lyi Leo Aip Ser Val 

8 5 9 0 9 5 

Ala Tbr Aip Tyr Oly Ala Ala lie Alp Oly Pbe lie Oly Aip Val Ser 
10 0 10 5 110 

Oly Leo Ala Am Oly An Gly Ala Tbr Gly Aip Pbe Ala Oly Ser Ain 



1 1 5 



1 2 5 



Ser Gin Met Ala Oln Val Oly Aip Gly Aip Ain Ser Pro Leo Mel Asa 

13 0 13 3 I 4 0 

AtoPbe Arg Gin Tyr Leo Pro Ser Leo Pro Ola Ser Val Olo Cy i Are 

J45 150. 133 1*0 



P r o 



Pbe Val Phe Ser Ala Oly Lyi Pro Tyr Olo Pbe Ser lie Aip Cyi 
1 6 5 1 7 0 1 7 3 



A»p Lyi lie Ain Leo Pbe Arg Oly Val Pbe Ala Pbe Leo Leo Tyr Val 

ISO 185 1°0 

Ala Tbr Phe Met Tyf Val Pbo Ser Tbr Phe Ala Ain lie Leo A r a Aia 

195 200 205 
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Lys Gin Sex 
2 1 0 



f 2 i INFORMATION FOR SEQ ID NO: 17: 



( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 50 amino acids 
( B ) TYPE: amu» ackl 
( D ) TOPOLOOY: < 



{ i i ) MOLECULE TYPE: protein 
( i i i ) HYPOTHETICAL: NO 
< x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



A I * O 1 u O 1 y A<p Asp Pro All Lys A I* All Pbe Aid Ser Leo Ola Al 



I 5 



0 >5 



Ser Ala Thr Oln T y r lie Oly Tyr Al* T r p All Mot V.l V.l V.l lie 

20 25 30 

V.l Oly Ala Thr Mi Gly lie Lys Leo Pbe Ly* Lys Pbe Thr Ser Lys 
35 ' *° 45 

All Ser 

5 0 

( 2 ) INFORMATION FOR SBQ ID NO: 18: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 bwe pain 
( B )TYPE: Bockk acid 
( C ) STRANDEDNESS: double 
{ D ) TOPOLOOY: line* 

( i i ) MOLECULE TYPE: RNA (genome) 

(iii ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCnlS: 
AAUCUUOOAO OCUUUUUUAU OOUUCOUUCU 

( 2 ) INFORMATION FOR SBQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 30 bue pairs 
( B ) TYPE: smckac acid 
( C ) STRANDEDNESS: docfcle 
( D ) TOPOLOGY: Hacsr 

( i i > MOLECULE TYPE: RNA (geoamc) 

(iii ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
UAACUAAOOA UOAAAUGCAU GUCUAAOACA 

( 2 ) INFORMATION FOR SEQ ID NO-20: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 30 base pain ■ ' 
•(B) TYPE: oneiric acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOOY: Eacar 

( i j ) MOLECULE TYPE: RWA<q*s*odc) 

(iii ) HYPOTHETICAL; NO 
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( i y ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
UCCUAOGACO UUUGACCUAU OCOAOCUUUU 30 



| 2 ) INFORMATION FOR SEQ ID NOJI : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 30 baje padre 
( B ) TYPE: oucteic acid 
( C ) STRANDS DNESS: dtwfcle 
f D ) TOPOLOOY: Xtx* 

( i i ) MOLECULE TYPE: RNA (genome) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
AUOUACUAAG GAOGUUOUAU GOAACAACOC 



( 2 ) INFORMATION FOR SEQ ID NO:22: 

{ t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 tee plan 
( B ) TYPE: ooclbc acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fine* 

( i i ) MOLECULE TYPE: DNA {genomic) 

( i i i ) HYPOTHETICAL: NO 

< i v ) ANTI-SENSE: NO 

(xi ) SEQUENCE DESCRIPTION: SEQ CD N022: 
OGCCGCAAAT TCTATTTCAA OGAOACAOTC AT 



( 2 ) INFORMATION FOR SEQ ID NOlO: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 h**c pain 
( B ) TYPE: aodeic add 
( C ) STRANDEDNESS: nagfe 
< D ) TOPOLOOY: Enear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N023: 
AATGAAATAC C TAT TGCCT A COOCAOCCOC TGOATT 



( 2 ) INFORMATION FOR SBQ ID N024: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 71 base pain 
( B ) TYPE: mckic seal 
( C ) STRANDEDNESS: smgje 
( D ) TOPOLOGY: fiacar 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 



4 x i ) SBQUENCE DESCRIPTION: SEQ CO NO£4: 
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OTT ATTACTC cctgcccaac cagccatogc cc 



( 2 ) INFORMATION FOR SEQ ID NCh25: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 29 base pairs 
I B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: itngk 
( D ) TOPOLOGY: Gacar 

( i i ) MOLECULE TYPE: DNA (jcnooac) 

( i i i ) HYPOTHETICAL: NO 

( i y ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N025: 
CAOTTTCACC TOOOCCATGO CTOOTTOOO 

{ 2 > INFORMATION FOR SEQ ID NO-.26: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 40 base pain 
( B ) TYPE: snckic acid 
( C ) STRANDEDNESS: ringle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i I ) HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NC«»: 
CAOCOAOTAA TAACAATCCA OCOOCTOCCO TAOOCAATAO 

( 2 ) INFORMATION FOR SEQ ID NO:27: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 3» base pain 
( B ) TYPE: wdek acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO ■> 

< i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-J7: 
OTATTTCATT ATOACTOTCT CCTTOAAATA OAATTTGC 

( 2 ) INFORMATION FOR SEQ ID N02S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 40 base pass 
( B )TYPE: oucJcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fiacar 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iit ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO . 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N028: 
AGOTOAAACT OCTCOAOATT TCTAOACTAO TTACCCOTAC 



( 2 ) INFORMATION FOR SEQ ID NOt29: 
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( i > SEQUENCE CHARACTERISTICS: 
f A ) LENGTH: 38 ha*> pain 
( 8 )TYFE: nucleic nod 
( C ) STRANDS DNESS: single 
( D ) TOPOLOGY: linear 

< i i > MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( t * ) ANTI-SENSE: NO 

SEQUENCE DESCRIPTION: SEQ ID NCK29: 

CGOAACGTCG TACOOOTAAC TAOTCTAOAA ATCTCOAO 



( 2 ) INFORMATION FOR SEQ ID NO-JO: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 bast pain 
( B )TYPE: ddcJoc acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Hacar 

< i i ) MOLECULE TYPE: DNA (geaefnic) 
( i i i > HYPOTHETICAL: NO 

< i r > ANTI-SENSE: NO 

< i i ) SEQUENCE DESCRIPTION: SBQ ID NO-JO: 
OACGTTCCGO ACTACCOTTC TTAATAGAAT TCO 



( 2 ) INFORMATION FOR SEQ ID NO-Jl: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 28 base pairs 
( B ) TYPE: aaciac acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOOY:Eae» 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i t i ) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOSI: 
TCGACOAATT CTATTAAGAA CCOTAOTC 



( 2 ) INFORMATION FOR SEQ ID NOJ2: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pain 
.(B) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

(i i ) SEQUENCE DESCRIPTION: SEQ ID NO-J2: 
TGAATTCTAA ACTAOTCOCC AAGOAOACAO T C AT 



( 2 ) INFORMATION FOR SEQ ID NOJ3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 36 base pain 
( B > TYPE: oockie acid 
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( C ) STRANDEDNESS: tingle 
{ D ) TOPOLOGY: lineal 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( i i > SEQUENCE DESCRIPTION: SEQ ID NO".33: 
AATOAAATAC CTATTOCCTA CGOCAOCCOC TCOATT 



( 2 ) INFORMATION FOR SEQ ID NO-3*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 31 base pabi 
< B ) TYPE: Bocleic add 
( C ) STRANDEDNESS: ungfe 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-34: 
GTT ATTACTC GCTOCCCAAC caoccatogc c 



{ 2 ) INFORMATION FOR SEQ ID NO J3: 

( t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pair* 
( B ) TYPE: noclek acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: Udcm 

( i j > MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-35: 
OAGCTCOTCA OTTCTAOAOT TAAOCOOCCO 



( 2 ) INFORMATION FOR SEQ ID N006: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENOTH: 48 bate pan 
( B ) TYPE: Bocleic acid 
< C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Bocae 

( i j ) MOLECULE TYPE: DNA (genomic) 

( iii) HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

( g i ) SEQUENCE DESCRIPTION: SEQ ID NOt36: 
OT ATT T CATT ATOACTGTCT CCTTOGCGAC TAOTTTAOAA TTCAAOCT 



( 2 ) INFORMATION FOR SEQ ID NOlH: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENOTH: 40 bwo pan 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
( D )TOPOLOOY: fine* 



( i t ) MOLECULE TYPE: DNA (genomic) 
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( i i i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-J7: 
CAGCOAGTAA TAACAATCCA OCGOCTOCCO TAOGCAATA 



2 ) INFORMATION FOR SEQ ID N038: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 27 base pain 
( B ) TYPE: nucldc acid 
( C > STRANDEDNESS: imgle 
( D )TOPOLOOY: tin** 

( i j ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SBQ ID N038: 
TGACOAOCTC GOCCATOGCT GOTTGOO 



( 2 ) INFORMATION FOR SEQ ID NOJ9: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 2* base pain 

< B ) TYPE: DDckic acid 

( C ) STRANDEDNESS: *mg}c 
( D ) TOPOLOGY: linear 

< i i ) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SBQ ID NC09: 
TCOACGOCCG CTTAACTCTA OAAC 



( 2 ) INFORMATION FOR SEQ ID NOtO: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 bwepaos 
( B )TYPE: nucleic «d 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Hour 

( i i ) MOLECULE TYPE: DNA (jownic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N04Ch 
AOOTSMARCT JCTCOAOTCW 00 



( 1 ) INFORMATION FOR SEQ ID NO:41: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 bate fmn 
< B ) TYPE: nockie jbd 
( C ) STRANDEDNESS: sbgk 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 



( i ▼ ) ANTI-SENSE: NO 
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f x i ) SEQUENCE DESCRIPTION: SEC* 0> NO*l: 
AOGTCCACCT GCTCGAGTCT GO 



( 2 ) INFORMATION FOR SEQ ID NO:42; 

I i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 22 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; (ingle 
( D ) TOPOLOGY: finest 

{ i i ) MOLECULE TYPE: DNA (genomic) 

< j i i ) HYPOTHETICAL: NO 

| iv ) ANTI-SENSE: NO 

( x i > SEQUENCE DESCRIPTION: SEQ ID NOvC: 
AOOTCCAOCT CCTCGAOTCA GO 

( 2 ) INFORMATION FOR SEQ ID NOt43: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 22 bate pain 
( B ) TYPE: nucleic acid 
( C > STRANDEDNESS: jingle 
( D ) TOPOLOGY: Ibear 

( i i ) MOLECULE TYPE; DNA (genomic) 

(iti ) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(ii ) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
AOOTCCAOCT TCTCGAOTCT GO 



( 2 ) INFORMATION FOR SEQ ID NO*4: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 22 base peon 
( B ) TYPE: index acid 
{ C ) STRANDEDNESS: ttngfe 
f D ) TOPOLOGY: linear 

( i i ) MOLECULE TYP& DNA (genomic) 

( i j i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
AOOTCCAOCT TCTCOAGTCA OO 

( 2 )INPORMAllONPORSEQIDNO:43: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 boo pain 
{ B ) TYPE: flockic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i j i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( ft > ) SEQUENCE DESCRIPTION: SEQ ID NO:45: 



AOOTCCAACT OCTCOAOTCT Q O 
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( 2 ) INFORMATION FOR SEQ ID NO:46: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base ptan 
( B )TYPE: mckac acid 
( C ) STRANDEDNESS: ringk 
( D ) TOPOLOGY: fioca 

( i i ) MOLECULE TYPE: DNA (jjcoannc) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE; NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:<6: 
AGGTCCAACT OCTCOAOTCA GO 



( 2 ) INFORMATION FOR SEQ ID NO:47: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 bue pan 
( B ) TYPE: ODckk acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: lincat 

( j i ) MOLECULE TYPE: DNA Qjeoonoc) 

( i i i ) HYPOTHETICAL: NO 

( i •» ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SBQ ID NO:47: 
AOOTCCAACT TCTCOAOTCT GO 



( 2 ) INFORMATION FOR SBQ ID NO: 48: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: raigJe 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ED NO:4S: 
AGGTCCAACT TCTCGAOTCA GG 



< 2 ) INFORMATION FOR SBQ ID NO:*9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pan 
C B ) TYPE: nucleic *ad 
( C ) STRANDEDNESS: ringte 
( D ) TOPOLOGY: fine* 

< i i ) MOLECULE TYPE: DNA (genwmc) 

( » i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
AOOTNNANCT NCTCOAOTCW OO 



{ 2 ) INFORMATION FOR SEQ ID NO.50: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base prin 
{ B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fioew 

< i i ) MOLECULE TYPE: DNA (gtxaaac) 
{ i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NOJO: 
OCCCAACGAT GTGCTCACC 



( 2 ) INFORMATION FOR SEQ ID NO-5I: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 39 Use pan 
( B ) TYPE: nucleic acid 

< C ) STRANDEDNESS: imsle 
( D ) TOPOLOGY; fine- 

( i i ) MOLECULE TYPE: DNA (genooric) 

< i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NCh5l : 
CTATTAGAAT TCAACOOTAA CAOTCGTOCC TTGGCCCCA 



( 2 ) INFORMATION FOR SEQ ID NCc32: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 38 bwt pmn 
( B >TYPE: nuckic arid 
( C ) STRANDEDNESS: ringk 
( D ) TOPOLOGY: tar 

( i i ) MOLECULE TYPE: DNA (grmomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i > SEQUENCE DESCRIPTION: SBQ ID NO.52: 
CTATTAACTA OTAACOGTAA CA.OTOOTOCC TTGCCCCA 



{ 2 ) INFORMATION FOR SEQ ID N053: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 hwc pan 
( B ) TYPE: obekk acid 
{ C ) STRANDEDNESS: tingfe 
{ D ) TOPOLOGY: hmM 

( i i > MOLECULE TYPE: DNA (gcoocmc) 

( i i i ) HYPOTHETICAL: NO 

( t t ) ANTI-SENSB: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-53: 
CTCAOTATGO TOOTTOTOC 



( 2 ) INFORMATION FOR SEQ ID NO-M: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 26 b*x ptnn 
( B >TYPE: axlacMad 
{ C ) STRANDEDNESS: ua&e 
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( D > TOPOLOOY: linear 
( i i ) MOLECULE TYPE: DNA (genomic) 
( i t i ) HYPOTHETICAL: NO 
( i ▼ ) ANTI-SENSE: NO 

f x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.34: 
OCTACTAGTT TTOATTTCCA CCTTGO 



( 2 ) INFORMATION FOR SEQ ID NOS5: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 hw pairs 
{ B ) TYPE: oucldc add 
( C ) STRANDEDNESS: lingJe 
( D ) TOPOLOOY: Encar 

( i i ) MOLECULE TYPE: DNA (geaaxnc) 

(tit ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

| i i ) SEQUENCE DESCRIPTION: SEQ ID NO-55: 
CAGCCATOGC COACATCCAO A TO 



< 2 ) INFORMATION FOR SEQ ID NOJ6: 

( i } SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pair* 
( B )TYPE: nucleic add 
( C > STRANDEDNESS: «n$k 
( D ) TOPOLOGY: focar 

( *i I ) MOLECULE TYPE: DNA (genoanc) 

(itt ) HYPOTHETICAL: NO 

( t y ) ANTI-SENSE: NO 

(li ) SEQUENCE DESCRIPTION: SEQ ID NO--56: 
AATTTTACTA OTCACCTTOO TOCTOCTOOC 



( 2 ) INFORMATION FOR SEQ ID NO-J7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 baoc fain 
( B ) TYPE : nucleic acid 
( C ) STRANDEDNESS: nagfe 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iit ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*J7: 
TATOCAACTA OTACAACCAC AATCCCTOOO CACAATTTT 



( 2 ) INFORMATION FOR SEQ ID NOt58: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE: ouckk add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



( i i ) MOLECULE TYPE: DNA (g««nk) 
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f i t i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCh58: 
AGCCTT ACTA OTACAATCCC TOGOCACAAT 



{ 2 ) INFORMATION FOR SEQ ID NO.59: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: nucleic acid 
< C ) STRANDEDNESS: raglc 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( t i ) SEQUENCE DESCRIPTION: SEQ ID NOt59: 
CCAOTTCCOA GCTCOTTOTO ACTCAGGAAT CT 



( 2 ) INFORMATION FOR SEQ ID NO.-60: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pm 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: jngle 
( D )TOPOLOOY: linear 

(it) MOLECULE TYPE: DNA (genomic) 

(tit ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOrfO: 
CCAOTTCCOA OCTCOTOTTO ACGCAGCCOC CC 



( 2 ) INFORMATION FOR SBQ ID NOrfl: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 boe jmn 
( B ) TYPE: oockk add 
( C ) STRANDEDNESS: natfe 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(ill ) HYPOTHETICAL: NO 

( i t ) ANTVSENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOAl: 
CCAOTTCCOA OCTCOTOCTC ACCCAGTCTC CA 



( 2 ) INFORMATION FOR SEQ ID N032: 

' ( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base ptti 
( B ) TYPE: oockac add 
( C ) STRANDEDNESS: lingfe 
( D ) TOPOLOOY: finear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 



(i») ANTI-SENSE: NO 
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( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
CCAOTTCCOA OCTCCAOATG ACCCAGTCTC CA 



( 2 ) INFORMATION FOR SEQ ID NCfc63: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
f B ) TYPE: nucleic add 
< C ) STRANDEDNESS: ungk 
( D ) TOPOLOGY: fine* 

{ i i ) MOLECULE TYPE: DNA (geoemk) 

( i j i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO^S: 
CCAOATOTOA OCTCOTOATO ACCCAOACTC CA 



( 2 ) INFORMATION FOR SEQ ID NO** 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: oocleic mad 
( c ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: finear 

( t i ) MOLECULE TYPE: DNA (genomic > 

< i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO**: 
CCAOATOTOA GCTCGTCATG ACCCAGTCTC CA 



( 2 ) INFORMATION FOR SEQ ID NOi65: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: ODclac acid 
( C ) STRANDEDNESS: nagk 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (jtoccnic) 

( i j i ) HYPOTHETICAL: NO 
(it) ANTI-SENSE: NO 

( x j ) SEQUENCE DESCRIPTION: SEQ ID NO*5: 
CCAOATOTOA OCTCTTOATO ACCCAAACTC AA 



( 2 ) INFORMATION FOR SEQ ID NO^S6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: noflfe 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCn66: 
CCAOATOTOA OCTCOTOATA ACCCAOOATO AA 
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( 2 ) INFORMATION FOR SEQ ID NO*7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 ha»e pmn 
( B ) TYPE: aaclac add 
( C ) STRANDEDNESS: imgfe 
( D > TOPOLOGY: linear 

( i t ) MOLECULE TYP6: DNA (yeooonc) 

( i i i > HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO .-67: 
OCAOCATTCT AOAOTTTCAO CTCCAOCTTO CC 



( 2 ) INFORMATION FOR SEQ ID NO*8: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 base pain 
( B > TYPE; noclcic acid 
( C ) STRANDEDNESS: nogfe 
f D ) TOPOLOGY: Enear 

(it) MOLECULE TYPE: DNA (graoeme) 

( i i i ) HYPOTHETICAL: NO 

< i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*»: 
CCGCCOTCTA OAACACTCAT TGCTGTTOAA OCT 



( 2 ) INFORMATION FOR SEQ ID NC*»: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 33 baac pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: ringfe 
( D ) TOPOLOOY: linear 

< i i > MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO** 
CCOCCOTCTA OAACATTCTO CAGOAGACAO ACT 



( 2 ) INFORMATION FOR SEQ ID NO: 70: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 32 base paks 
( B ) TYPE: nucleic acid 

< C ) STRANDEDNESS: angle 
( D ) TOPOLOOY: Eoear 

( i i ) MOLECULE TYPE: DNA genomic) 

< i i i ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
CCAOTTCCOA GCTCOTOATO ACACAOTCTC CA 



( 2 ) INFORMATION FOR SEQ ID NO:71: 

( j ) SEQUENCE CHARACTERISTICS: 
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( a ) LENGTH: 3* banc peon 
( B ) TYPE: Buckie acid 
( C ) STRANDEDNESS: i ingle 
< D ) TOPOLOOY: linear 

{ i j > MOLECULE TYPE: DNA (genomic) 

(tii ) HYPOTHETICAL: NO 

{ i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:71: 
OCOCCOTCTA OAATTAACAC TCATTCCTOT TGAA 



( 2 ) INFORMATION FOR SEQ ID NO:72: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 3» b«e pass 
( B ) TYPE: oockic aod 
( C ) STRANDEDNESS: imgfe 
( D ) TOPOLOGY: fine* 

( i i ) MOLECULE TYPE: DNA (genoax) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NO:72: 
CTATTAACTA OTAACOCTAA CAOTOGTGCC TTGCCCCA 



( 2 ) INFORMATION FOR SBQ ID NO: 73: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 30 base pain 
( B ) TYPE: juctdc mad 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (gcaxcoc) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( * i ) SEQUENCE DESCRIPTION: SBQ ID NO:73: . 
AGOCTTACTA OTACAATCCC TOOOCACAAT 



( 2 ) INFORMATION FOR SEQ ID NO:74: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 29 base pair* 
( B ) TYPE: ODckic acid 
( C ) STRANDEDNESS: angle 
( D )TOP03jOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ i i i > HYPOTHETICAL: NO 

< i » > ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SBQ ID NO:74: 
OCCOCTCTAG AACACTCATT CCTOTTGAA 



( 2 ) INFORMATION FOR SEQ ID NCX75: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pasi 
( B )TYPE: flockac add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: fine* 
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( i i ) MOLECULE TYPE DNA (genomic) 
( i i i ) HYPOTHETICAL: NO 
( i y ) ANTI-SENSE: NO 

(x») SEQUENCE DESCRIPTION: SEQ H> N073: 
AOOTNNANCT NCTCGAOTCT OC 



( 2 ) INFORMATION FOR SEQ U> NO: 76: 

( j ) SEQUENCE CHAXACTERJ5T1CS: 
( A ) LENGTH: 22 hue pan 
{ B ) TYPE: auckic acid 
{ C ) STRANDEDNESS: »ingk 
( O ) TOPOLOGY: Bucm 

{ i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:76 : 
AOOTNNANCT NCTCOAOTCA GC 



( 2 ) INFORMATION FOR SBQ ID NOV77: 

( i ) SEQUENCE CliAJUCTERJSTlCS: 
( A ) LENGTH: 35 bme pan 
( B ) TYPE: cuckx tad 
( C ) STRANDEDNESS: ungk 
( D ) TOPOLOGY: Snear 

< i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( iv) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NO:77: 
OTOCCAOATO TOAOCTCOTO ATOACCCAGT CTCCA 



( 2 ) INFORMATION FOR SBQ ID NO:7S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 U*o pro 
( B ) TYPE: oockic Kid 
( C ) STRANDEDNESS: mgle 
( D ) TOPOLOOT: fine* 

( i L ) MOLECULE TYPE: DNA (genomic) 



( i i i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N07S: 
TCCTTCTAOA TTACTAACAC TCTCCCCTOT TOAA 



3 4 



( 2 )1>^PORMATION FORSBQlDNO:79-. 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 b«M poi 
( B ) TYPE: nocloc acid 
( C ) STRANDEDNESS: sm^e 
( D ) TOPOLOGY: 

( i i ) MOLECULE TYPE: DNA (genomic) 

(ill ) HYPOTHETICAL: NO 
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( i * ) ANTI-SENSE: NO 

SEQUENCE DESCRIPTION: SEQ ID NO-.79: 
OCATTCTAOA CTATTATOAA CATTCTOTAO OOOC 



( 2 > INFORMATION FOR SEQ ID WMO: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 37 hue pair* 
{ B ) TYPE: oocleic acid 
( C ) STRANDEDNESS: «ngk 
{ D ) TOPOLOGY: liDcai 

( » i ) MOLECULE TYPE: DNA (gcwxmc) 

( i i i ) HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
CTGCACAOOO TCCTOGOCCO AGCTCOTOOT GACTCAO 



( 2 ) INFORMATION FOR SEQ ID NO->l: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 b«*e ptan 
( B ) TYPE: mdeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: tinear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it ) ANTI-SENSE: NO 

(x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.81: 
AGNTOCANNT OCTCOAOTCT GO 



( 2 ) INFORMATION FOR SEQ ID NO.52: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 43 base pain 
( B ) TYPE: wckac acid 
( C ) STRANDEDNESS: wngle 
( D ) TOPOLOGY: fine* 

< i i ) MOLECULE TYPE: DNA (gatxmc) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

(i i ) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
OTOOOCATOT GTOAGTTGTO TCACTAOTTG GOGTTTTOAO 



( 2 ) INFORMATION FOR SEQ E> NO:83: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgfe 
( D ) TOPOLOGY: finer 

( i i ) MOLECULE TYPE: DNA {genomic) 

{ j i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.S3: 
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AOCATCACTA otacaaoatt toqgctc 



( 2 ) INFORMATION FOR SEQ ID NO-.84: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pair* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: jingle 
{ D ) TOPOLOGY: fincar 

- ( i i ) MOLECULE TYPE; DNA (gcnocsc) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.84: 
AOOTGC AGCT OCTCOAGTCT OO 



( 2 ) INFORMATION FOR SEQ ID NO-.85: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
< D ) TOPOLOOY: line* 

( i » ) MOLECULE TYPE: DNA (gcocanc) 

< t i i ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NCnttS: 
AOOTGCAGCT OCTCOAGTCO OO 



( 2 ) INFORMATION FOR SEQ ID NOS6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 b*»c pans 
< B ) TYPE: nockac add 
( C ) STRANDEDNESS: singae 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

< i i i > HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:96: 
AO0T0CAACT GCTCGAOTCT GO 



( 2 ) INFORMATION FOR SEQ ID NO:87: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pair* 
( B )TYPE: oudcic aod 
( C ) STRANDEDNESS: sanfe 
( D ) TOPOLOGY: fine* 

( i i ) MOLECULE TYPE: DNA (geaaac) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO-J7: 



AOOTOCAACT OCTCOAGTCO OO 
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( 2 ) INFORMATION FOR SEQ ID NO 88: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pain 
{ B ) TYPE: oockic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

< i t ) ANT] -SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NCfcM: 
TCCTTCTAOA TTACTAACAC TCTCCCCTGT TGAA 



( 2 ) INFORMATION FOR SEQ ID NO-.3* 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 37 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: «ngk 
( D ) TOPOLOGY: fine* 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NChW: 
CTGCACAOOO TCCTOOOCCO AOCTCOTOOT OACTCAG 



( 2 ) INFORMATION FOR SEQ ID NO90: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 31 base pair» 
( B ) TYPE: nucleic ackl 
( C ) STRANDEDNESS: imfr 
( D ) TOPOLOGY: Enear 

( i i ) MOLECULE TYPE: DNA (gaocMc) 

(tit ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NCh90r 
OCATTCTAOA CTATTAACAT TCTOTAOGOO C 



( 2 ) INFORMATION FOR SEQ ID N<fc9J : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pain 
< B ) TYPE: DDcldc add 
( C ) STRANDEDNESS: ttngte 
{ D ) TOPOUOO Y: fine* 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N03J : 
ACCCAAOOAC ACCCTCATO 



( 1 ) INFORMATION FOR SEQ ID NOSl: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) 1-ENOTH: 19 bato pan 
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( B ) TYPE: nucleic acid 

< C ) STRANDEDNESS: nngie 

( D ) TOPOLOGY: linear 

{ i i ) MOLECULE TYPE: DNA ^genomic) 

<ii> > HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

(, i ) SEQUENCE DESCRIPTION: SEQ ID NO:92: 
CTCAOTATOG TOOTTGTOC 



( 2 > INFORMATION FOR SEQ ID NC*93: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 bwo pain 
( B )TYPE: nucleic acid 
( C ) STRANDEONESS: ragie 
( D ) TOPOLOOY: line* 

{ i i ) MOLECULE TYPE: DNA (genomic) 

{ i i i > HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
OTCTCACTAG TCTCCACCAA GOOCCCATCO OTC 



( 2 ) INFORMATION FOR SEQ ID ^9*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 bete pain 
( B ) TYPE: npdeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: Bnear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i I ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( z i ) SEQUENCE DESCRIPTION: SEQ ID NCfc94: 
ATATACTAOT OAOACAOTOA CCAGGGTTCC TTOOCCCC 



{ 2 ) INFORMATION FOR SEQ ID N095: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 brae pair* 
< B ) TYPE: nocJeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( I v ) ANTI-SENSE: NO 

( x i } SEQUENCE DESCRIPTION: SBQ ID NOS5: 
ACOTCTAOAT TCCACCTTOO TCCC 



( 2 ) INFORMATION FOR SEQ ID NOS6: 

< i ) SBQUENCE CHARACTERISTICS: 
( A ) LENGTH: 31 b«e jmn 
( B ) TYPE: endoc add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: Baear 
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( i i ) MOLECULE TYPE: DNA (genomic) 
( i i i > HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

SEQUENCE DESCRIPTION: SEQ ID NOS6: 
OCATACTAOT CTATTAACAT TCTOTAOGGG C 

( 2 ) INFORMATION FOR SEQ ID NOS7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base paws 
( B ) TYPE: nucleic wad 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Enor 

( i i ) MOLECULE TYPE: DNA (genomic) 

(ill ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ H)NOt97: 
CCOOAATTCT TATCATTTAC CCOOAOA 

( 2 ) INFORMATION FOR SEQ ID NOSS: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 29 base pab» 
( B ) TYPE: aoclac tad 
( C > STRANDEDNESS: stogie 
( D )TOFOLOOY: VmcM 

< i i ) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL; NO 

( i ▼ ) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NOSB: 
TCTOCACTAO TTOOAATOOO C AC A TO C AG 

( 2 ) INFORMATION FOR SEQ ID NOS* 

( i ) SEQUENCE CHARACTERISTICS: 
( A )LENOTH:79»WpBW 
( B ) TYPE: onckac jcxS 
( C ) STRANDEDNESS: double 
< D ) TOPOLOGY: Guar 

( i i ) MOLECULE TYPE: DNA (genomic) 

(ill ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOt9* 



OOCCGCAAAT 


TCTATTTCAA 


COAGAC AOTC 


AT A AT 0 AAAT 


ACCT ATTOCC 


TACOOCAOCC 


6 0 


OCTOOATTOT 


TATTACTCOC 


TGCCCAACCA 


OCCATOOCCC 


AOOTOAAACT 


OCTCOAOTCA 


I 20 


OOACCTGGCC 


TCOTOAAACC 


TTCTCAOTCT 


CTOTCTCTCA 


CCTOCTCTOT 


CACTO ACT AC 


1 S 0 


T CCATC ACC A 


OTGC T TATTA 


CTOOAACTOO 


ATCCOOC AOT 


T TCCAOOAAA 


CAAACTOOAA 


2 4 0 


TOOATOGGCT 


ACAT A AOCT A 


CO ACOGTGTC 


AAT AAGT ATO 


ATCCATCTCT 


CA AOA ATCGA 


3 0 0 


ATCT CCATC A 


CTCOTOACAC 


ATCT A AC A AT 


CAOTTTTTCC 


AOAAOTTOAT 


TT CTOTOACT 


3 6 0 


TCTOAOOACA 


C AOOAACAT A 


TGACTGTTC A 


AOAOOOACTA 


OOOCCTCTOC 


TATOOACTAC 


4 2 0 


TOOOOTCAAO 


OAATTTCAOT 


C ACCOTCTCC 


TCAOCC AAA A 


CO AC ACCCCC 


ATCTOTCTAT 


4 8 0 
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6 0 0 

6 6 0 

7 2 0 
7 8 0 
7 9 8 



CCACTCGCCC CTOOATCTGC TOCCCAAACT A ACTCC ATOG TGACCCTGOO ATGCCTOQTC 540 

AAOGOCT ATT TCCCTGAOCC AG TGACAGTG ACCTGGAACT CTGGATCCCT OTCCAOCGGT 

OTOCACACCT TCCCAGCTGT CCTGCAGTCT GACCTCTACA CTCTOAGCAG CTCAGTGACT 

GTCCCCTCCA OCCCTCGOCC CAGCOAGACC OTCACCTGCA ACGTTOCCCA CCCOOCCAOC 

AOCACCAAGO TOOACAAGAA AATTGTOCCC AOGOATTGTA CTAOTTACCC GTACGACGTT 

CCOOACTACO GTTCTTAA 

( 2 ) INFORMATION FOR SEQ ID NO: 100: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 194 base fast 
( 8 ) TYPE: curiae tad 
( C > STRANDEDNESS: Ax*le 
{ D > TOPOLOGY: hour 

( i i ) MOLECULE TYPE: DNA (flcnomic) 

( j i i ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.100: 
TGAATTCTAA ACTAGTCGCC AAGGAOACAO TCATAATOAA ATACCTATTG CCTACOOCAO 
CCOCTOOATT OTTACTCOCT OCCCAACCAO CCATOOCCOA GCTCC AGATO ACCCAOTCTC 
CAOCCTCCCT ATCTOCATCT OTOOOAOAAA CTOTCACCAT CACATOTCOA TCAAOTOAOA 
AT ATTACAAT TACT 



6 0 

1 2 0 

1 8 0 

1 9 4 



( 2 > INFORMATION FOR SEQ ID NO: 101: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 333 b*»e pair* 
( B ) TYPE: ouckac acid 
( C ) STRANDEDNESS: double 

< D ) TOPOLOGY: Hxcm 

( I i ) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL: NO 
( i » ) ANTI-SENSE: NO 

< x i) SEQUENCE DESCRIPTION: SEQ U> NOtlOl: 
CTGATOCTOC ACC A AC TOT A TCCATCTTCC CACCATCCAO TOAOCAOTTA ACATCTOOAG 
OTOCCTCAGT COTGTOCTTC TTOAACAACT TCTACCCCAA AOACTACAAT OTCAAOOOOA 
AG ATTOATOO CAGTOAACGA CAAAATOOCO TCCTOAACAO TTGGACTGAT CAOOACAOCA 1»0 
AAOACAOCAC CTACAOCATO AOCAOCACCC TCACOTTOAC CAAOGACOAO TATOAACOAC 
ATAACAOCTA TACCTOTOAT OCCACTCACA AGACATCAAC TTCACCCATT OTCAAOAOCT 
TCAACAOOAA TOAOTOTTAA ttctaoacoo cgc 

( 2 ) INFORMATION FOR SEQ ID NO: 102: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 150 b*»e p*k* 
( B ) TYPE: aackic tad 
( C ) STRANDEDNESS: double 
{ O ) TOPOLOGY: &dejc 

( i i ) MOLECULE TYPE: DNA (g**onnc) 

( t i i ) HYPOTHETICAL: NO 



6 0 
1 2 0 



2 4 0 

3 0 0 
3 3 3 



121 



5.759,817 

-continued 



122 



( L t ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
GCTOAOOGTG ACGATCCCGC AAAAGCOGCC 
t AT ATCOGTT ATGCGTGOOC OATGOTTGTT 
CTOTTTAAGA AATTCACCTC OAA AOCAAGC 



( 2 ) INFORMATION FOR SEQ ID NCM03: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: >i&g)e 
< D ) TOPOLOGY: Eonr 

( i i ) MOLECULE TYPE: DNA (gcoonnc) 

(tii ) HYPOTHETICAL: NO 

( j t ) ANTI-SENSE: NO 

( i i > SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
OTOCCCAOOO ATTOTACTAO TOCTOAOOOT GACOAT 



( 2 ) INFORMATION FOR SEQ ID NO: 104: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 b«s* pairs 
f B ) TYPE: nucleic tad 
( C ) STRANDEDNESS: nagle 
( D )TOPOLOGY: fiacar 

( i i ) MOLECULE TYPE: DNA (genomic) 

(lit ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.104: 
ACTCOAATTC TATCAOCTTO CTTTCOAOOT GAA 



( 2 ) INFORMATION FOR SEQ ID NO: 103: 

( i } SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base port 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tm&c 
( D ) TOPOLOGY: fine* 



( i i ) MOLECULE TYPE: DNA (gem 


aac) 


( i i i ) HYPOTHETICAL: NO 




(tr) ANTI-SENSE: NO 




( x i ) SEQUENCE DESCRIPTION: SI 


2Q ID NO: 105: 


AGGTCCAOCT TCTCOAGTCT 


GO 


( 2 ) INFORMATION FOR SEQ ID NO: 106: 





I 2 



( i } SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 hate pairs 
( B ) TYPE: oucJeic add 
( C ) STRANDEDNESS: tingle 
• ( D ) TOPOLOO Y: fiaear 



( i i ) MOLECULE TYPE: DNA (genomic) 



TTTAACTCCC TOCAAGCCTC AOCGACCOAA 6 0 

OTCATTOTCO GCOCAACTAT CGGTATCAAG 120 



( i i i ) HYPOTHETICAL: NO 
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( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCfclOC: 
OTCACCCTCA GCACTAOTAC AATCCCTOOO CAC 

( 2 )I^^PORMA^ONFORSEQID NO:l07: 

< i j SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 48 base pan 
{ B ) TYPE: nuckic acid 
{ C ) STRANDEDNESS: amftfe 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (gcnoisic) 

(iii ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i > SBQUENCE DESCRIPTION: SEQ ID NO: 107: 
OAOACOACTA GTGOTGGCOO TGOCTCTCCA TTCGTTTOTG AATATCAA 

( 2 ) INFORMATION FOR SEQ H> NO: 108: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 40 base pair* 
( B ) TYPE: nucleic scid 
{ C ) STRANDEDNESS: smgfe 
( D )TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
TTACTAOCTA OCATAATAAC OGAATACCCA AAAGAACTGG 

( 2 ) INFORMATION FOR SEQ ID NO: 109: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base jmn 
( B ) TYPE: nocJck waA 
( C ) STRANDEDNESS: imflje 
( D ) TOPOLOOY: nneaf 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

{ i y ) ANTI-SENSE: NO 

( xi) SEQUENCE DESCRIPTION: SEQ ID NO-.109: 
TATOCTAOCT AGTAACACOA CAOOTTTCCC GACTOO 

( 2 ) INFORMATION FOR SEQ ID NO". 110: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 heme pmn 
( B ) TYPE: nockk add 
( C ) STRANDEDNESS: onsjk 
( D)TC*?«JpOY:n»t 

( i i ) MOLECULE TYPE: DNA (gcooonc) 

( i 1 i ) HYPOTHETICAL: NO 

( i ? ) ANTI-SENSE: NO 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



125 



5.759.817 

126 

-continued 



( 2 ) INFORMATION FOR SEQ ID NO- 113: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: TOSWpairi 
( B } TYPE: nxleic add 
< C ) STRANDEDNESS: double 
( D ) TOPOLOGY: hacm 

( i i ) MOLECULE TYPE: DNA (goodie) 



6 0 
I 2 0 



t $ 6 



™ -> 7 

ACCOAGCTCO AATTCOTAAT CATOOTC 

( 2 ) INFORMATION FOR SEQ ID NO-l 11 : 

f i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 186 base pair* 
( B ) TYPE: nucleic jod 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: nnew 

( t i ) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL: NO 
ANTI-SENSE: NO 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO. 11 1 : 
OTOCCCAOOO ATTGTACTAO TGC TOAGOOT OACOATCCCO CAAAAGCOGC C TT TAACTCC 
CTOCAAGCCT CAOCOACCOA ATATATCGOT TATGCOTGGG CGATOGTTOT TOTCATTOTC 
GOCOCAACTA TCGOT AT CA A OCT G T TT A AO AAATTCACCT CGAAAGCAAO CTOATAOAAT ISO 
TCGAGT 

( 2 ) INFORMATION FOR SEQ ID NO: 11 2: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 666 boe pus 
( B ) TYPE: iDcicic acid 
( C ) STRANDEDNESS: docble 
( D >TOPOLOOY: toe* 

( i i ) MOLECULE TYPE: DNA (genome) 

(iii ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:H2: 

CCATTCOTT T GTOAATATCA AGOCCAAOOC CAATCOTCTO ACCTOCCTCA ACCTCCTOTC 

AATGCTOOCO OCOGCTCTOO TGOTOOTTCT OOTOGCOOCT CTOAOOOTGG TGOCTCTOAO 

OOTOOCOOTT CTOAOG0T0O COOCTCTOAO OOAOGCGOTT CCGGTGOTOO CTCTOOTTCC 

OOTGATTTTG ATTATOAAAA OATOOCAAAC OCTAATAAGG OOOCTATOAC COAAAATOCC 

OATGAAAACG COCTACAGTC TGACGCTAAA OOC A A ACT TG ATTCTGTCOC TACTOATTAC 

GOTGCTOCTA TCOATOOTTT CATTOOTGAC OTTTCCOOCC TTOCTAATGO TAATOOTOCT 

ACTOOTOATT TTGCTOOCTC T A A T T C C.C A A ATOOCTCAAO TCOGTOACGO TOATAATTCA 

CCTTTAATOA ATAATTTCCO TCAATATTTA CCTTCCCTCC CTCAATCOOT TOAATOTCOC 

CCTTTTOTCT TTAOCOCTGO TAAACCATAT OAATTTTCTA TTOATTOTOA CAAAATAAAC 

TTATTCOOTO TCTTTGCGTT TCTTTTATAT OTTOCCACCT TTATOTATGT ATTTTCTACG 

TTTOCTAACA TACTGCOTAA TAAGOAGTCT TAATCATOCC AOTTCTTTTO OOTATTCCOT 
TATTAT 



6 0 
t 2 0 
I 8 0 
14 0 
3 0 0 

3 6 0 

4 2 0 

4 8 0 

5 4 0 

6 0 0 
6 6 0 
6 6 6 



( i i i ) HYPOTHETICAL: NO 
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( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 11 3: 

OAGACGACTA OTGOTGOCOO TOGCTCTCCA TTCOTTTGTG AATATCAAGG CCAAGOCCAA 

TCGTCTOACC TGCCTCAACC TCCTOTCAAT GCTCOCGGCO OCTCTOOTOG TGOTTCTOOT 

OOCOGCTCTG AGOOTOGTGG CTCTGAGOOT OOCCOTTCTG AGOGTOOCOO CTCTGAOCOA 

GOCGOTTCCO OTOGTOOCTC TGGTTCCGOT GATTTTOATT ATOAAAAGAT OOCAAACGCT 

AATAAGOOOG CTATOAC CO A AAATOCCOAT GAAAACOCGC TACAGTCTOA COCTAAAOGC 

AAACTTOATT CTOTCOCTAC TGATTACGGT GCTGCTATCO ATOOTTTCAT TOGTOACGTT 

TCCGOCCTTG CTAATOOTAA TOOTOCTACT GOTOATTTTO CTGGCTCTAA TTCCCA AATG 

OCTCAAOTCG OTGACOOTOA TAATTCACCT TTAATGAATA ATTTCCGTCA ATATTTACCT 

TCCCTCCCTC AATCGOTTGA ATOTCOCCCT TTTOTCTTTA GCOCTOOTAA ACCATATOAA 

TTTTCTATTO ATTOTGACAA AATAAACTTA TTCCOTGOTO TCTTTOCOTT TCTTTTATAT 

OTTOCCACCT TTATOTATOT A TTTTCTACG TTTGCT AACA TACTOCOTAA TAAGGACTCT 

TAATCATOCC AGTTCTTTTG OOTATTCCOT TATTATOCTA OCTAOTAA 

( 2 ) INFORMATION FOR SEQ IDNOLH: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 201 b*»e pakf 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: Eowr 

( i i ) MOLECULE TYPE: DNA (getmsc) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( I i ) SEQUENCE DESCRIPTION: SEQ ID NOUl* 
TATGCTAOCT AGTAACACGA CAGGTTTCCC OACTOGAAAG CGOGCAGTGA GCOCAACGCA 
ATTAATGTGA GTT AOCTCAC TCATTAGOCA CCCCAOOCTT TACACTTTAT OCTTCCOGCT 
COTATOTTOT GTOOAATT OT OAGCOOATAA CAATTTCACA CAOGAAACAO CTATOACCAT 
OATTACOAAT TCGAOCTCGO T 

( 2 ) INFORMATION FOR SEQ ED NO:115: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 330 base pak» 
( B ) TYPE: mckic acid 
( C > STRANDEDNESS: dottok 
( D ) TOPOLOGY: finear 

( i i ) MOLECULE TYPE: DNA (saxmoc) 

(ili ) HYPOTHETICAL: NO 

( i t > ANTI-SENSE: NO 

(si) SEQUENCE DESCRIPTION: SEQ 05 NO:113: 
AOOTCCAOCT TCTCGAGTCT OOACCTOOCC T CO T OA A AC C TTCTCAOTCT CTOTCTCTCA 
CCTOCTCTOT CACTOACTAC TCCATCACCA GTGCTTATTA CTOGAACTOO ATCCOOCAGT 
TTCCAGGAAA CAAACTOOAA TOOATOOGCT ACATAAOCTA COACOGTOTC AATAAGTATO 
ATCCATCTCT CAAOAATCGA ATCTCCATCA CTCOTOACAC ATCTAACAAT CAOTTTTTCC 
AOAAOTTOAT TTCTOTGACT TCTGAOGACA CAOGAACATA TO AC TOTTCA AOAOGOACTA 



6 0 
I 2 0 

1 8 0 

2 40 

3 0 0 

3 6 0 

4 2 0 

4 8 0 

5 4 0 

6 0 0 
6 6 0 
70 8 



6 0 

1 2 0 

1 8 0 

2 0 I 



6 0 
] 2 0 

1 8 0 

2 4 0 

3 0 0 
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3 6 0 

4 2 0 

4 S 0 

5 4 0 

6 0 0 

6 6 0 

7 2 0 

7 $ 0 

8 3 0 



OGOCCTCTGC TATOGACTAC TOGGGTCAAO GAATTTCAOT CACCGTCTCC TCAOCCAAAA 

CGACACCCCC ATCTOTCTAT CCACTOOCCC CTOOATCTOC TOCCCAAACT AACTCCATGC 

TGACCCTGGO ATOCCTGGTC AAGOGCT ATT TCCCTOAGCC AGTGACAOTG ACCTGGAACT 

CTGOATCCCT GTCCAOCGOT GTOCACACCT TCCCAOCTOT CCTOCAOTCT OACCTCTACA 

CTCTGAOCAG CTCAOTOACT OTCCCCTCCA GCCCTCOGCC CAOCOAOACC OTCACCTOCA 

ACOTT0CCCA cCCOOCCAOC AOCACCAAGO TOOACAAOAA A ATTGTOCCC AOGOATTG T A 

CTAGTOCTO A GOGTOACOAT CCCOCAAAAO CGGCCTTTAA CTCCCTGCAA OCCTCAGCOA 

CCOAATATAT COOTTATOCO TOGOCOATGG TTOTTOTCAT TOTCOOCOCA ACTATCGOTA 

TCAAOCTOTT TAAOAAATTC ACCTCGAAAO CAAOCTGATA GAATTCGAGT 

( 2 ) INFORMATION FOR SBQ ID NOtI 16: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: .260 base pair* 
( B ) TYPE: tuddc add 
( C ) STRANDEDNESS: doobk) 
( D ) TOPOLOOY: linear 

( t i ) MOLECULE TYPE: DNA (gcommc) 
< i i i ) HYPOTHETICAL: NO 
• (iv) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO 11 6: 
ATGAAATACC TATTGCCTAC OGCAGCCGCT GOATTOTTAT TACTCGCTOC CCAACCAOCC 
ATOOCCCAGO TOAAACTGCT COAGATTTCT AOACTAOTOC TOAOGOTOAC GATCCCOCAA 
AAOCOGCCTT TAACTCCCTO CAAGCCTCAO CGACCOAATA TATCOOTTAT GCOTOOOCGA 
TOGTTOTTOT CATTOTCGOC GCAACTATCG OTATCAAGCT OTTTAAGAAA TTCACCTCGA 
AAOCAAOCTG ATAGAATTCO 

( 1 ) INFORMATION FOR SBQ ID HOM7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: <6l bsws pair* 
( B ) TYPE: Buckie cad 
( C ) STRANDEDNESS: <btiMs 
( D ) TOPOLOOY: fiBccr 

( i i ) MOLECULE TYPE: DNA (g«amic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NO: 1 1 7: 

0TACOCGCCC TGTAOCOOCO CATTAAOCOC GGCOGOTOTO OTOOTTACOC OCAGCOTCAC 

COCTACACTT OCCAOCGCCC TAOCOCCCOC TCCTTTTOCT TTCTTCCCTT CCTTTCTCGC 

CACOTTCOCC GGCTTTCCCC GTCAAOCTCT AAATCGGOOG CTCCCTTTAO OOTTCCOATT 

TAGTGCTTTA COGCACCTCG ACCCCAAAAA AC T T O AT TAG GGTGATGOTT CACGTAGTOO 240 

OCCATCOCCC TOATAGACOG TTTTTCOCCC TTTOACOTTO OAGTCCACOT TCTTTAATAG 

TOOACTCTTO TTCCAAACTG GAACAACACT CAACCCTATC TCOGTCTATT CTTTTOATTT 

ATAAOOOATT TTOCCOATTT COOCCTATTO OTTAAAAAAT OAGCTGATTT AACAAAAATT 

TAA'CGCOAAT TTTAACAAAA TATTAACOTT TACAATTTAA A 



6 0 
1 2 0 

1 8 O 

2 4 0 
2 6 0 
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( 2 ) INFORMATION FOR SEQ ID NO:llS: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pair* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: **mgk 
( D ) TOPOLOGY: linear 

{ i i ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 1 18: 
OCXAT AAACC CTCACTAAAO GO 



( 2 ) INFORMATION FOR SEQ ID NO:lt9: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 21 ba»c jwr» 
( B ) TYPE: oockic acid 
( C ) STRANDEDNESS: single 
< D ) TOPOLOGY: tincar 

( L i ) MOLECULE TYPE: DNA {genomic) 

( i i i ) HYPOTHETICAL: NO 

{ i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE r^ESCWTlON: SEQ ED NO:ll* 
TCTCGCACAA TAATACACOO C 



( 2 ) INFORMATION FOR SEQ ID NOU20: 

( k ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 84 base pair* 
( B ) TYPE: nucleic acid 
( C) STRANDEDNESS: •mgfe 
< D )TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i j ) HYPOTHETICAL: NO 

{ t t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:120: 
OTOTATTATT OTOCGAOANN SNNSNNSNNS NNSNNSNNSN NSNNSNNSNN SNNSNNSNNS 
NNSNNSTOGG OCCAAGOOAC CACO 



( 2 ) INFORMATION FOR SEQ ID NO: 121: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pair* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: ta&c 
( D ) TOPOLOGY: fineat 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

(i't) SEQUENCE DESCRIPTION: SEQ ID NO-.I21: 
OCATOTACTA GTTTTOTCAC AAGATTTGOO 



( 2 ) INFORMATION FOR SEQ ID NCrl22: 
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( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 18 fame pain 
( B ) TYPE: nxldc sad 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: hocsr 

( i » ) MOLECULE TYPE: DNA (gewjanc) 
( i i i ) HYPOTHETICAL: NO 

< i ▼ ) ANTI-SENSE: NO 
< SEQUENCE DESCRIPTION: SEQ CD NO:I22: 

TOOGGC CA AO GGACCACO 



( 2 ) INFORMATION FOR SEQ ID NO-.I23: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 1» base pain 
( B ) TYPE: oockic acid 
( C ) STRANDEDNESS: single 
{ D ) TOPOLOGY: fine* 

( j i ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

(, i ) SEQUENCE DESCRIPTION: SEQ ID NChI23: 
GTOTATTATT GTOCOAGA 



( 2 ) INFORMATION FOR SEQ ID NO: 3 24: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 base pun 
( B ) TYPE: nucleic add 
{ C ) STRANDEDNESS: angle 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (goonric) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i x ) FEATURE: 

( A ) NAME/KEY: rtf>eM_jepon 
( B ) LOCATION: I9..21 

( D ) OTHER INFORMATION: /ipt_JTP c3 ~ r * I * fcin ~ 

/ MXC3 M NNS cna be repeated fern 3 to about 7A 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:12* 
GTOTATTATT GTGCGAOANN STOOGOCCAA OOCACCACO 



( 2 ) INFORMATION FOR SEQ ID NO. 125: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 base prrin 
( B ) TYPE: ouckk acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Saear 

( i i ) MOLECULE TYPE: DNA (geaocnic) 

(tit ) HYPOTHETICAL: NO 

( /» v ) ANTI-SENSE: NO 

( i x ) FEATURE: 

( A ) NAME/KEY: repej*_jegic* 
{ B ) LOCATION: I9..21 

( D ) OTHER INFORMATION: /rpt_nrpc-~tn«fair 
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/ soie=^NKK can be rcpu&xi ban 3 to about 24 

( x i ) SEQUENCE DESCRIPTION: SEQ H> NO: 125: 
OTGTATTATT GTGCGAOANN KTOGCGCCAA GGGACCACG 



What is claimed is: 10 2. The method of claim 1 wherein said 3 f terminus has the 

1 A method for producing a library of complementarity nucleotide sequence 5'-TGGGGCCAAGGGACCACG-3' 

determining region (CDR)-mutagenized phage-displayed (SEQ ID NO 122). or an oligonucleotide having a sequence 

immunoglobulin heterodimers. the method comprising the complementary thereto. 

steps 0 f : 3. The method of claim 1 wherein said 5' terminus has the 

1) amplifying a CDR portion of a template immunoglo- 15 nucleotide sequence S^GTGTATTATTGTGCGAGAO' 
bulin variable domain gene selected from the group (SEQ ED NO 123) or an oligonucleotide having a sequence 
consisting of a template immunoglobulin heavy chain complementary thereto. 

variable domain gene and a template immunoglobulin 4 The method of claim 1 wherein said template immu- 
light chain variable domain gene, wherein said tern- aoglobulin heavy and light chain genes are obtained from a 
plate immunoglobulin heavy and light chain genes ^ human. 

have a framework region and said CDR portion and 5 The method of claim 1 wherein said CDR portion is 
encode respective heavy and light chain variable CDR3. 

domain polypeptides, and wherein said amplifying is 6 The method of claim 1 wherein said formula is: 

by polymerase chain reaction (PCR) using a PCR 

primer oligonucleotide for mutagenizing a preselected 25 y-GTxrrATTATTGTcxrG 
nucleotide region in said CDR portion, thereby fc*rning CACG-y (SEQ id no 124). 

a library of amplified CDR-mutagenized immunoglo- ? ^ of daim j wherein n is 16 in the formula 

bulin gene fragments, said PCR primer oligonucleotide (SEQ ID NO 120). 

having T and 5' termini and comprising: g ^ mcthod of claira j wherein said formula is: 

a) a nucleotide sequence at the 3' terminus capable of ^ 

hybridizing to a first framework region of said S'^ttgtattattgtgcgagajwoc^ 
selected template immunoglobulin variable domain CACG-r (SEQ n> NO 125). 

gene; 9. a method for producing a complementarity determin- 

b) a nucleotide sequence at the 5* terminus capable of region (CDR)-mutagenized phage-displayed immuno- 
hybridizing to a second framework region of said 35 lobulin heterodimer with altered antigen binding 
selected template immunoglobulin variable domain specificity . the method comprising the steps of: 

gene; and ^ amplifying a CDR portion of a template immunoglo- 

c) a nucleotide sequence between the 3 and 5 termini bu] ^ variablc doma i n gcDC selected from the group 
according to the formula selected from the group consisting of a template immunoglobulin heavy chain 
consisting of: 40 variable domain gene and a template immunoglobulin 

light chain variable domain gene, wherein said tcm- 
[NNSUandlNNKU . plate immunoglobulin heavy and light chain genes 

wherein N is independently any nucleotide. S is G or C and hayc a {^^0^ region and said CDR portion and 

K is G or T. and n is 3 to about 24. the 3* and 5* terminal encode respective heavy and light chain variable 

nucleotide sequences having a length of about 6 to 50 45 domain polypeptides having a preselected antigen 
nucleotides, or an oligonucleotide having a sequence binding specificity to a first antigen, and wherein said 

complementary thereto; amplifying is by polymerase chain reaction (PCR) 

2) inserting individual members of the library of amplified usmg a pqr primer oligonucleotide for mutagenizing a 
CDR-mutagenized immunoglobulin gene fragments preselected nucleotide region in said CDR portion to 
formed in step (1) into a dicistronic phagemid expres- 50 aj tcr sa jd preselected antigen binding specificity, 
sion vector comprising immunoglobulin heavy and thereby forming a library of amplified CDR- 
light chain variable domain genes that lack the immu- mutagenized immunoglobulin gene fragments, said 
noglobulin gene portion corresponding to the fragment PCR primer oligonucleotide having 3* and 5 f termini 
to be inserted, wherein upon insertion said vector is and comprising: 

capable of expressing heavy and light chain variable 55 a ) a nucleotide sequence at the 3' terminus capable of 
domain polypeptides encoded by said vector, thereby hybridizing to a first framework region of said 

forming a library of ditistronic expression vectors selected template immunoglobulin variable domain 

containing amplified CDR-mutagenized immunoglo- gcnc; 

bulin gene fragments; and b) a nucleotide sequence at the 5' terminus capable of 

3) expressing said immunoglobulin heavy and light chain 60 hybridizing to a second framework region of said 
genes in the library of dicistronic expression vectors selected template immunoglobulin variable domain 
formed in step (2) whereby said encoded heavy and gcnc; ^ 

light chain variable domain polypeptides assemble on c ) a nucleotide sequence between the J and 5* termini 

the surface of a phage to form a phage-displayed according to the formula selected from the group 

immunoglobulin heterodimer, thereby producing a 63 consisting of: 

library of CDR-mutagenized phage-displayed immu- 
noglobulin heterodimers. fNNSL and INNKL, 
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wherein N is independently any nucleotide, S is G or C and 
K is G or T. and n is 3 to about 24. the 3' and 5* terminal 
nucleotide sequences having a length of about 6 to 50 
nucleotides, or an oligonucleotide having a sequence 
complementary thereto; 5 

2) inserting individual members of the library of amplified 
CDR-mutagenized immunoglobulin gene fragments 
formed in step (1) into a dicistronic phagemid expres- 
sion vector comprising immunoglobulin heavy and 
light chain variable domain genes that lack the immu- io 
noglobulin gene portion corresponding to the fragment 
to be inserted, wherein upon insertion said vector is 
capable of expressing heavy and light chain variable 
domain polypeptides encoded by said vector, thereby 
forming a library of dicistronic expression vectors 15 
containing amplified CDR-mutagenized immunoglo- 
bulin gene fragments; 

3) expressing said immunoglobulin heavy and light chain 
genes in the library of dicistronic expression vectors 
formed in step (2) whereby said encoded heavy and 
light chain variable domain polypeptides assemble on 
the surface of a phage to form a phage-displayed 
irnmunoglobulin heterodimer. thereby producing a 
library of CDR-mutagenized phage-displayed immu- ^ 
noglobulin heterodimers; and 

4) immunoreacting members of the library of CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimers formed in step (3) on a preselected second 
antigen, said second antigen being different than said ^ 
first antigen to allow for selection of a CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimer with altered antigen binding specificity. 

10. The method of claim 9 wherein said 3* terminus has 
the nucleotide sequence 5*-TGGGGCCAAGGGACCACG- 35 
3* (SEQ ID NO 122), or an oligonucleotide having a 
sequence complementary thereto. 

11. The method of claim 9 wherein said 5* terminus has 
the nucleotide sequence S'-GTGTArTATTGTGCGAGAO' 
(SEQ ID NO 123) or an oligonucleotide having a sequence ^ 
complementary thereto. 

12. The method of claim 9 wherein said template immu- 
noglobulin heavy and light chain genes are obtained from a 
human. 

13. The method of claim 9 wherein said CDR portion is 45 
CDR3. 

14. The method of claim 9 wherein said formula is 

5'<TIXnAnAnrjTGCG AGAfNNS kTOGGGCC AAGGGAC- 
CACG-3' (SEQ ID NO 124). 

50 

15. The method of claim 9 wherein n is 16in the formula 
[NNS]„ (SEQ ID NO 120). 

16. The method of claim 9 wherein said formula is: 

5' -GTGTATTATTGTGC G AG A(NNK J^TCGGGCC AAGGG AC- 
CACG-3' (SEQ ID NO 125). 

17. A method for producing a soluble complementarity 
determining region (CDR>mutagenized immunoglobulin 
heterodimer with altered immunoreactiviry to a preselected 
antigen, the method comprising the steps of: eo 

1) amplifying a CDR portion of a template irnmunoglo- 
bulin variable domain gene selected from the group 
consisting of a template immunoglobulin heavy chain 
variable domain gene and a template immunoglobulin 
light chain variable domain gene, wherein said tern- 65 
plate mmiunoglobulin heavy and ligjit chain genes 
have a framework region and said CDR portion and 
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encode respective heavy and light chain variable 
domain polypeptides immunoreactive with a prese- 
lected antigen, and wherein said amplifying is by 
polymerase chain reaction (PGR) using a PCR primer 
oligonucleotide for mutagenizing a preselected nucle- 
otide region in said CDR portion to alter said immu- 
noreactiviry of said immu noglobulin heterodimer to the 
preselected antigen, thereby forming a library of ampli- 
fied CDR-mutagenized immunoglobulin gene 
fragments, said PCR primer oligonucleotide having 3* 
and 5' termini and comprising: 

a) a nucleotide sequence at the 3* terminus capable of 
hybridizing to a first framework region of said 
selected template immunoglobulin variable domain 
gene; 

b) a nucleotide sequence at the 5' terminus capable of 
hybridizing to a second framework region of said 
selected template immunoglobulin variable domain 
gene; and 

c) a nucleotide sequence between the 3* and 5* termini 
according to the formula selected from the group 
consisting of; 

(NNSL and fNNKU, 
wherein N is independently any nucleotide. S is G or t and 
K is G or T. and n is 3 to about 24. the 3' and 5' terminal 
nucleotide sequences having a length of about 6 to 50 
nucleotides, or an oligonucleotide having a sequence 
complementary thereto; 

2) inserting individual members of the library of amplified 
CDR-mutagenized immunoglobulin gene fragments 
formed in step (1) into a dicistronic phagemid expres- 
sion vector comprising immunoglobulin heavy and 
light chain variable domain genes that lack the immu- 
noglobulin gene portion corresponding to the fragment 
to be inserted, wherein upon insertion said vector is 
capable of expressing heavy and light chain variable 
domain polypeptides encoded by said vector, thereby 
forming a library of dicistronic expression vectors 
containing amplified CDR-mutagenized immunoglo- 
bulin gene fragments; 

3) expressing said immunoglobulin heavy and light chain 
genes in the library of dicistronic expression vectors 
formed in step (2) whereby said encoded heavy and 
light chain variable domain polypeptides assemble on 
the surface of a phage to form a phage-displayed 
immunoglobulin heterodimer. thereby producing a 
library of CDR-mutagenized phage-displayed immu- 
noglobulin heterodimers; 

4) immunoreacting members of the library of CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimers produced in step (5) on said preselected 
antigen to isolate an immunoglobulin heterodimer hav- 
ing altered immunoreactivity; 

5) isolating said immunoreacted CDR-mutagenized 
phage-displayed immunoglobulin heterodimer 
obtained in step (4); 

6) producing a soluble farm of said immunoreacted form 
of immunoreacted CDR-mutagenized phage-displayed 
immunoglobulin heterodimer isolated in step (5); and 

7) assaying said soluble form of immunoreacted CDR- 
mutagenized immunoglobulin heterodimer prepared in 
step (6) to identify a CDR-mutagenized immunoglo- 
bulin heterodimer with altered immunoreactivity to the 
preselected antigen. 

18. The method of claim 17 wherein said assaying is 
determined by an increase in affinity to the preselected 
antigen. 
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19. The method of claim 18 wherein the affinity is greater 
than lO^M" 1 dissociation constant (K^- 

20 The method of claim 17 wherein said 3' terminus has 
the nucleotide sequence y-TGGGGCCAAGGGACCACG- 
3' (SEQ ID NO 122). or an oligonucleotide having a 
sequence complementary thereto. 

21 The method of claim 17 wherein said 5* terminus has 
the nucleotide sequence 5*-CTGTXri7VrTGTGCGAGA-3* 
(SEQ ID NO 123) or an oligonucleotide having a sequence 
complementary thereto. 

22. The method of claim 17 wherein said template immu- 
noglobulin heavy and light chain genes are obtained from a 
human. 
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23. The method of claim 17 wherein said CDR portion is 
CDR3. 

24. The method of claim 17 wherein said formula is 

; S'^nGTyOTTXnXnXjCGAGAINNSinTGGGGCCAAGGGAC- 
CACG-3* (SEQ ED NO 124> 

25. The method of claim 17 wherein n is 16in the formula 
[NNS]„ (SEQ ID NO 120). 

26. The method of claim 17 wherein said formula is: 

° S'-GTTCJrATIXITCrTXX^GAG 

CACG-3' (SEQ ID NO 125)l 

***** 
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T cell receptors 0"CRs) exhibit genetic and structural diversity 
similar to antibodies, but they have binding affinities that are 
several orders of magnitude lower. It has been suggested that TCRs 
undergo selection in vivo to maintain lower affinities. Here, we 
show that there is not an inherent genetic or structural limitation 
on higher affinity. Higher-affinity TCR variants were generated in 
the absence of in vivo selective pressures by using yeast display 
and selection from a library of Va CDR3 mutants. Selected mutants 
had greater than 100-fold higher affinity (K D ** 9 nM) for the 
peptide/MHC ligand while retaining a high degree of peptide 
specificity. Among the high-affinity TCR mutants, a strong prefer- 
ence was found for CDR3a that contained Pro or Gly residues. 
Finally, unlike the wild-type TCR, a soluble monomeric form of a 
high-affinity TCR was capable of directly detecting peptide/MHC 
complexes on antigen-presenting cells. These findings prove that 
affinity maturation of TCRs is possible and suggest a strategy for 
engineering TCRs that can be used in targeting specific peptide/ 
MHC complexes for diagnostic and therapeutic purposes. 

T cells recognize a foreign peptide bound to the MHC product 
through the a/3 heterodimeric receptor. The T cell receptor 
TCR) repertoire has extensive diversity created by the same 
y ene rearrangement mechanisms used in antibody heavy- and 
light-chain genes (1). Most of the diversity is generated at the 
junctions of V and J (or diversity, D) regions that encode the 
complementarity-determining region three (CDR3) of the a and 
P chains (2). However, TCRs do not undergo somatic point 
mutations as do antibodies, and perhaps not coincident ally, 
TCRs also do not undergo the same extent of affinity maturation 
as antibodies. TCRs appear to have affinities that range from 10 5 
to 10 7 M" 1 whereas antibodies have affinities that range from 10 5 
to 10 10 M" 1 (3, 4). 

Whereas the absence of somatic mutation in TCRs may be 
associated with lower affinities, it has also been argued that there 
is not a selective advantage for a TCR to have higher affinity 
(5-7). In fact, the serial-triggering (6) and kinetic proofreading 
(7) models of T cell activation both suggest that very slow 
off-rates (associated with higher affinity) would be detrimental 
to the signaling process. On the other hand, the fastest off-rates 
that have been measurable have been associated with altered 
pMHC that exhibits antagonist activity (8-11). Whereas the 
narrow range of natural TCR affinities has provided some 
evidence for the relationships between off-rates and agonist/ 
antagonist activity, there are also examples that appear to be 
inconsistent with these hypotheses (12, 13). 

There are other possible explanations for why the T cell system 
maintains relatively low TCR:pMHC affinities in vivo. Peptides 
bound within the MHC groove display limited accessible surface 
(14), which may in turn limit the amount of free energy that can 
be generated in the interaction. On the other hand, raising the 
affinity of a TCR by directing the free energy toward the MHC 
lelices: would presumably lead to thymic deletion during nega- 
tive selection (15). Even if such higher-affinity TCR could escape 
thymic deletion, they would likely not maintain the peptide 
specificity required for T cell responses. 



It has not been possible to directly test thess possibilities 
because the generation of TCRs with affinities above 10 7 M" 1 
has not been accomplished. In addition to allowing a kinetic 
basis of T cell triggering, high-affinity TCRs could be used to 
more easily explore the role of peptide in pMHC recognition, 
and as quantitative probes for the expression of pMHC on 
various target cells. Because in vivo selection schemes have not 
yielded TCRs with the intrinsic binding affinities of affinity- 
matured antibodies, in this report, we have used an in vitro 
method for the directed evolution of high-affinity TCRs. The 
method relies on the expression of a library of mutant single- 
chain (VjS-linker-Va) TCRs. on the surface of yeast, as a fusion 
to the surface protein Aga-2 (16, 17). Our previousTstudies have 
shown that the yeast display system could be used to engineer 
variants of the 2C single-chain TCR (scTCR) that were more 
thermally stable and secreted at higher levels (17, 18). The 
stability mutants were isolated by subjecting the entire TCR gene 
to random mutagenesis and selecting for increased surface levels 
with anti-TCR antibodies (17). The mutations that increased 
stability resided at the Va: Vj3 interface or on the outside surface 
of Vj3 in a region not involved in pMHC binding. To isolate TCR 
with higher affinity for pMHC, in the present study, we mutated 
only the CDR3a loop, which is at the center of the pMHC- 
binding site (19). Our efforts were guided by previous findings 
that this region contributed minimal binding free energy to the 
interaction of the 2C TCR with the pMHC ligand QL9/L d (20), 
suggesting that productive interactions might be improved by 
focusing on this region. Remarkably, selection from a relatively 
small library (10 5 mutants) yielded many different TCRs with up 
to 100-fold increased affinity for QL9/IA The high-affinity 
TCRs retained a high degree of peptide specificity although 
there was some variation in fine specificity among the mutants. 
These findings suggest that the in vitro evolution process de- 
scribed here can be used to isolate TCRs with specificities that 
one defines by selection with appropriate pMHC ligands. 

The high-affinity receptors in this study were derived by 
variation at the VJ junction, the same process that operates very 
effectively in vivo through gene rearrangements in T cells (2). 
The fact that we could readily isolate a diverse set of high-affinity 
TCR in vitro indicates that there is not a genetic or structural 
limitation to high-affinity receptors. This supports the view that 
inherently low affinities of TCRs found in vivo are caused by a 
lack of selection for higher affinity and perhaps a selection for 
lower affinity (5-7). Finally, the high-affinity TCR were used in 
monomeric form to detect pMHC on the surface of target cells, 
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Fig. 1. Flow cytometric analysis of yeast cells that express wild-type and 
mutant 2C TCR on their surface. Yeast cells displaying wild-type (17) and 
mutant (m6 and m13) scTCR were stained with anti-V08 antibody F23.2 
(120 nM), the specific a lloantigenic peptide-MHC QL9/L d /lg (40 nM), or a null 
peptide MCMV/L d /'9 (40 nM). The peptides used in this study were QL9 
(QLSPFPFDL), MCMV (YPHFMPTNL), and p2Ca (LSPFPFDL). Binding was de- 
tected by FITC-conjugated goat anti-mouse IgG F(ab'h and analyzed by flow 
cytometry. The negative population (e.g., seen with F23.2 staining) has been 
observed for all yeast-displayed proteins and is thought to be caused by cells 
at a stage of growth or induction that are incapable of expressing surface 
fusion protein (16, 17, 27). 

indicating that soluble forms of the TCR selected with the yeast 
display system can serve as probes for tumor-associated pMHC 
or other T cell-specific ligands. 

Materials and Methods 

Library Construction. The 2C single-chain TCR (scTCR) used as 
the scaffold for directed evolution (T7) contained six mutations 
(0G17E, /3G42E, 0L81S, aL43P, aW82R, and aI118N) that 
have been shown to increase the stability of the TCR but still 
allow pMHC binding (E.V.S., KJD.W., and DM.K., unpublished 
results; and ref. 18). Mutagenic PCR of the T7 scTCR VaCDR3 
was performed by using an AGA-2-specific upstream primer and 
a degenerate downstream primer 5'-CTTTTGTGCCGGATC- 
CAAATGTCAG(SNN) 5 GCTCACAGCACAGAAGTACACG- 
GCCGAGTCGCTC-3'. Underlined bases indicate the positions 
of silent mutations introducing unique BamHl and Eagl restric- 
tion sites. The purified PCR product was digested withMiel and 
BamBI and ligated to Mtel-fomHI-digested T7/pCT302 (16- 
18). The ligation mixture was transformed into DH10B electro-: 
competent Escherichia coli (GIBCO/BRL), and transformants 
were pooled into 250-ml LB supplemented with ampicillin at 
100 fig/ml and grown overnight at 37°C. Plasmid DNA was 
transformed into the yeast strain EB Y100 by the method of Gietz 
and Schiestl (21). 

. Cell Sorting. The yeast library (22) was grown in 2% dextrose/ 
0.67% yeast nitrogen base/1% Casamino acids (Difco) at 30°C 
to an ODeoo = 4.0. To induce surface scTCR expression, yeast 
were pelleted by centrifugation, resuspended to an ODeoo = 1.0 




Wild Type 


TCR 


VOCDR3 








2C 


93 SGFASAL 104 






Mutant 


TCR 


VCXCDR3 


Mutant TCR 


VOCDR3 




ml 


SSYONYL 


mlO 


SLPPPLL 




m2 


SRROHAL 


mil 


f SIPTPSL 




m3 


SSKOTAL 


ml 2 


' SNPPPLL 




m4 


SHPOTRL 


ml3 


SDPFPLL 




to5 


SMPOTRL 


ml 4 


SSPPPRL 




mS 


SHQGRYL 


ml 5 


SAPPP1L 




ra7 


SYLOLRL 








m8 


SKHOIHL 








m9 


SLTORYL 







Fig. 2. Structure and sequences of the 2C TCR CDR3ot. {A) X-ray crystal lo- 
graphk structure of the 2C/dEV8/K b complex with CDR3o aa highlighted. Five 
residues of the 2C VaCDR3 that were randomized by.PCR with a degenerate 
primer are shown in red. The adjacent CDR3 residues. Ser-93 and Leu-104 
shown in blue, were retained in the yeast display library because they have 
been shown to be important in pMHC binding (17, 1 8, 20). (B) Alignment of aa 
sequences of mutant scTCRs isolated by yeast display and selection with 
QL9/L d . Display plasmids were isolated from yeast clones after selection and 
sequenced to determine CDR3a sequences. Mutants ml, m2, m3, m4, m10, 
and m 11 were isolated after the third round of sorting. All other mutants were 
isolated after the fourth round of sorting. 

in 2% galactose/0.67% yeast nitrogen base/1% Casamino acids, 
and incubated at 20°C for ~24 h. In general, **10 7 cells per tube 
were incubated on ice for 1 h with 50 /tl of QL9/L d /IgG dimers 
(23) diluted in PBS (pH 7.4) supplemented with 0.5 mg/ml BSA. 
After incubation, cells were washed and labeled for 30 min with 
FITC-conjugated goat anti-mouse IgG F(ab') 2 (Kirkegaard & 
Perry) in PBS (pH 7.4) supplemented with 0.5 mg/ml BSA. 
Yeast were then washed and resuspended in PBS (pH 7.4) 
supplemented with 0.5 mg/ml BSA immediately before sorting. 
Cells exhibiting the highest fluorescence were isolated by using 
a Coulter 753 bench fluorescence-activated cell sorter. After 
isolation, sorted cells were expanded in 2% dextrose/0.67% 
yeast nitrogen base/1% Casamino acids and induced in 2% 
galactose/0.67% yeast nitrogen base/1% Casamino acids for 
subsequent rounds of selection. A total of four sequential sorts 
were performed. The concentrations of QL9/L d /IgG dimers 
used for staining were 50 ^g/vol for sorts one to three and 
05 /ig/ml for the final sort. The percentages of total cells 
isolated from each sort were 5.55, 2.68, 2.56, and 0.58%, 
respectively. Aliquots of sorts three and four were plated on 2% 
dextrose/0.67% yeast nitrogen base/1% Casamino acids to 
isolate individual clones which were analyzed by flow cytometry 
by using a Coulter Epics XL instrument. 

Soluble scTCR Production. The T7 and m6 scTCR genes were 
excised from pCT302 Nhel-Xhol and ligated into Nhel-Xhol 
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digested pRSGALT, a yeast expression plasmid (18). Ligations 
were transformed into DH10B electrocompetent E. colt 
(GIBCO/BRL). Plasmid DNA was isolated from bacterial cul- 
. tures and transformed into the Saccharomyces cerevisiae strain, 
BJ5464 (a ura3-52 trpl leu2M his3A200 pep4::HIS3 prblL1.6R 
canl GAL) (18). Yeast clones were grown in 1 liter of 2% 
dextrose/0.67% yeast nitrogen base/1% Casamino acids/20 
mg/liter Trp for 48 h at 30°C. To induce scTCR secretion, cells 
were pelleted by centrifugation at 4,000 X g, resuspended in 1 
liter of 2% galactose/0.67% yeast nitrogen base/1% Casamino 
acids/20 mg/1 Trp supplemented with 1 mg/ml BSA, and 
incubated for 72 h at 20°C. Culture supernatants were harvested 
by centrifugation at 4,000 X g, concentrated to «=50 ml, and 
dialyzed against PBS (pH 8.0). The six His-tagged scTCRs were 
purified by native nickel affinity chromatography [Ni-NTA 
Superflow, Qiagen (Chatsworth, CA); 5 mM and 20 mM imi- 
dazole (pH 8.0) wash; 250 mM imidazole elution] (18). 

Cell-Binding Assays. The binding of soluble scTCRs to QL9/L d 
was monitored in a competition format as described (20, 24). 
Peptide-loaded T2-L d cells (3 x 10 5 per well) were incubated for 
1 h on ice in the presence of 125 I-Iabeled anti-L d Fabs (30-5-7) 
and various concentrations of scTCRs. Bound and unbound 
[ 125 I] 30-5-7 Fabs were separated by centrifugation through olive 
oil/dibutyl phthalate. Inhibition curves were constructed to 
determine inhibitor concentrations yielding 50% maximal inhi- 
bition. Dissociation constants were calculated by using the 
formula of Cheng and Prusoff (25). To monitor direct binding of 
scTCRs to cell-bound pMHC, peptide-loaded T2-L d cells (5 x 
10 5 per tube) were incubated for 40 min on ice with biotinylated 
soluble scTCRs followed by staining for 30 min with streptavi- 
din-phycoerythrin (PharMingen). Cellular fluorescence was de- 
tected by flow cytometry. 

Results and Discussion 

To examine if it is possible to generate higher-affinity TCR that 
would retain peptide specificity, we subjected a TCR to a process 



of directed in vitro evolution. Phage display (26) has not yet 
proven successful in the engineering of single-chain TCRs 
(scTCRs, V/3-linker-Va) despite the extensive structural simi- 



larity between antibody and TCR V regions. However, we 
recently showed that a scTCR could be displayed on the surface' 
of yeast (17) in a system that has proven successful in antibody 
engineering (16, 27). A temperature-stabilized variant (T7) (18) 
of the scTCR from the cytotoxic T lymphocyte clone 2C was used 
in the present study. Cytotoxic T lymphocyte clone 2C recog- 
nizes the alioantigen L d with a bound octamer peptide called 
p2Ca, derived from the enzyme 2-oxoglutarate dehydrogenase 

(28) . The nonameric variant QL9 is also recognized by cytotoxic 
T lymphocyte 2C, but with 10-fold higher affinity by the 2C TCR* 

(29) . Alanine scanning mutagenesis showed that the CDR3a 
loop contributed minimal free energy to the binding interaction 
(20), even though structural studies have shown that CDR3a of 
the 2C TCR is near the peptide and it undergoes a conforma- 
tional change to accommodate the pMHC complex (19). Thus, 
we focused our mutagenesis efforts on five residues that form the 
CDR3a loop. 

A library of 10 5 independent TCR-CDR3a yeast mutants was 
subjected to selection by flow cytometry with a fluorescently 
labeled QL9/L d ligand (23). After four rounds of sorting and 
growth, 15 different yeast colonies were examined for their 
ability to bind the ligand, in comparison to the scTCR variant T7, 
which bears the wild-type CDR3a sequence (Fig. 1 and data not 
shown). The anti-V08.2 antibody F23.2 which recognizes resi- 
dues in the CDR1 and CDR2 was used as a control to show that 
wild-type scTCR-T7 and scTCR mutants (m6 and ml3, in Fig. 
1, and others, data not shown) each had approximately equiv- 
alent surface levels of the scTCR (Fig. 1). In contrast, the soluble 
QL9/L d ligand bound very well to each mutant yeast clone but 
not to wild-type scTCR-T7. The MCMV/L d complex, which is 
not recognized by cytotoxic T lymphocyte clone 2C, did not bind 




□ QL9-5F 

B QL9-5Y 

H QL9-5H 

0 Q19-5E 

H MCMV 



Clone 

Fig. 3. Fine specificity analysis of mutant scTCR binding to different QL9 variant peptides bound to L d . The original T ceil clone 2C and various yeast clones were 
analyzed by flow cytometry for binding to L d /lg dimers loaded with wild-type QL9 (QL9-5F), position 5 variants of QL9 <QL9-5Y, QL9-SH. and QL9-5E) or MCMV. 
Binding was detected with FITC-labeled goat anti-mouse IgG. Relative fluorescence was measured by comparison with mean fluorescence values of 2C cells or 
yeast cells stained with anti-V08 antibody F23.2. 
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Inhibitor Concentration (uM) 

fig. 4. QL9/L d binding by soluble scTCRs. T2-L d cells loaded with QL9 were 
incubated with ^-labeled anti-l d Fab fragments (30-5-7) and various 
concentrations of unlabeled Fab (□), scTCR-T7 (B), or mutant scTCR-m6 (O) 
Bound and unbound [ 125 l]30-5-7 Fab fragments were separated by cen- 
tnfugation through olive oil/dibutyl phthalate. Binding of 125 l-labeled 
anti-l* Fab fragments to T2-L d cells loaded with the control peptide MCMV 
was not inhibited even at the highest concentrations of scTCRs (data not 
shown). 



to the scTCR mutants or wild-type scTCR-T7, indicating that the 
scTCR mutants retained peptide specificity. 

The CDR3 at sequences of the 15 mutants all differed from the 
2C TCR (Fig. 2). It was readily apparent (and confirmed by a 
BLAST alignment algorithm) that the sequences could be aligned 
mto two motifs. One motif contained Gly in the middle of the 
five residue stretch whereas the other motif contained three 
tandem Pro. Evidence that all three Pro are important in 
generating the highest affinity site is suggested by results with 
mutant mil. Mutant mil contained only two of the three Pro 
and exhibited reduced binding compared with the triple-Pro 
mutants (data not shown). The Gly-containing mutants ap- 
peared to have preferences for positive-charged residues among 
the two residues to the carboxyl side (7/9) and aromatic and/or 
positive-charged residues among the two residues to the amino 
side (4/9 and 5/9). The selection for a glycine residue at position 
102 in the motif may indicate that the CDR3a loop required 
conformational flexibility around this residue to achieve in- 
creased affinity. This is consistent with the large (6-A) confor- 
mational difference observed between the CDR3a loops of the 
liganded and unliganded 2C TCR (19). It is also interesting that 
Gly is the most common residue at the V(D)J junctions of 
antibodies and the presence of a Gly has recently been associated 
with increased affinity in the response to the (4-hydroxy-3- 
nitrophenyl) acetyl hapten (30). 

In contrast to the isolates that contain Gly, the selection for 
a Pro-rich sequence at the tip of the CDR3a loop may suggest 
that these TCR exhibit a more rigid conformation that confers 
higher affinity. The x-ray crystallographic structures of a germ- 
line antibody of low affinity compared with its affinity-matured 
derivative showed that the high-affinity state was associated with 
stabilization of the antibody in a configuration that accommo- 
dated the hapten (31). Similarly, the NMR solution structure of 
a scTCR that may be analogous to the germ-line antibody 
showed that the CDR3a and j3 loops both exhibited significant 
mobility (32). Recent thermodynamic studies of TCRrpMHC 
interactions have also suggested the importance of conforma- 
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Fig. 5. Flow cytometric analysis of the binding of scTCR/biotin to cell 
surface peptide/MHC. Peptide-loaded T2-L<* cells were incubated with 
biotinylated m6 scTCR (-0.3 pM) or T7 scTCR (-1 .6 jiM) scTCR followed by 
streptavidin-PE and analyzed by flow cytometry. (A) Flow cytometry his- 
tograms of T2-L d cells loaded with QL9 (unshaded), p2Ca (light shade), or 
MCMV (dark shade) and stained with m6 scTCR/biotin. (fi) Mean fluores- 
cent units (MFU) of T2-L d cells loaded with QL9, p2Ca, or MCMV and stained 
with either secondary SA-PE only, T7 scTCR/biotin + SA-PE, or m6 scTCR/ 
biotin + SA-PE. 



tional changes in binding (33, 34). Structural and thermodynamic 
studies of the TCR mutants that we report here should allow us 
to examine if the two CDR3a motifs (Gry- versus Pro-rich) might 
differ in the mechanism by which they confer higher affinity. 

Although the scTCR mutants did not bind the null peptide /L d 
complex MCMV/L d , it remained possible that the increase in 
affinity might be accompanied by a change in fine specificity. To 
examine this issue, we used QL9 position 5 (Phe) peptide 
variants that have been shown previously to exhibit significant 
differences in their binding affinity for the wild-type 2C TCR 
(35). The binding of these pMHC to various TCR mutants on the 
yeast surface and clone 2C were measured by flow cytometry. As 
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shown in Fig. 3, the native TCR on 2C is capable of binding QL9 
variants that contain either Tyr or His at position five but not 
Glu. Each of the higher-affinity TCR mutants retained their 
ability to recognize the conserved Tyr-substituted peptide and 
they were likewise incapable of recognizing the Glu-substituted 
peptide. However, several of the TCR mutants (m6, m9, and 
ml3) bound to the His-substituted peptide (albeit to different 
extents) whereas other mutants (m5, m7, and ml4) did not bind 
the peptide. Thus, the CDR3a loop can influence the peptide 
fine specificity of recognition but it is not the only region of the 
TCR involved. The effect on peptide specificity could be through 
direct interaction of CDR3a residues with the variant peptide, as 
suggested from earlier studies involving CDR3-directed selec- 
tions (36, 37). Alternatively, binding free energy may be directed 
at peptide-induced changes in the L d molecule itself. The latter 
possibility is perhaps more likely in the case of the 2C 
TCR:QL9/L d interaction, because position five of QL9 has been 
predicted to point toward the L d groove (35, 38). The fine- 
specificity analysis also shows that it is possible to engineer TCRs 
with increased, or at least altered, specificity for cognate pep- 
tides. Thus, directed evolution of only a short region (CDR3a) 
of a single TCR allows the design of TCR variants with altered 
peptide-binding specificities. 

To determine the magnitude of the affinity increases associated 
with a selected CDR3a mutant, the wild- type T7 scTCR and the m6 
scTCR were expressed as soluble forms in a yeast secretion system. 
Purified scTCR preparations were compared for their ability to 
block the binding of a l25 I-labeled anti-L d Fab fragments to QL9 or 
MCMV loaded onto L d on the surface of T2-L d cells. As expected, 
neither T7 nor m6 scTCR were capable of inhibiting the binding of 
125 I-Fab fragments to T2-L d cells up-regulated with the MCMV 
peptide (data not shown). However, both T7 and m6 were capable 
of inhibiting the binding of anti-L d Fab fragments to QL9/L d (Fig. 
4). The m6 scTCR variant was as effective as unlabeled Fab 
fragments in inhibiting binding, whereas the T7 scTCR was 160-fold 
less effective (average of 140-fold difference among four indepen- 
dent titrations). The Ko values of the scTCR for QL9/L d were 
calculated from the inhibition curves to be 1.5 fjM for T7 and 9.0 
nM for m6. The value for T7 is in close agreement with the 3.2 /xM 
K D reported for the 2C scTCR (39). These findings show that the 
yeast system, combined with CDR3a-directed mutagenesis, is 
capable of selecting mutants with 100-fold higher intrinsic binding 
affinities for a pMHC ligand. 

If the soluble scTCR has a high affinity for its pMHC ligand, 
then it may be useful, like antibodies, as a specific probe for 
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cell surface-bound antigen. To test this possibility, the soluble 
T7 and m6 scTCR were biotinylated and the labeled scTCR 
were incubated with T2-L d cells loaded with QL9, p2Ca, or 
MCMV. The m6 scTCR, but not the T7 scTCR, yielded easily 
detectable staining of the T2 cells that had been incubated with 
QL9 or p2Ca (Fig. 5 A and £). It is significant that p2Ca-up- 
regulated cells were also readily detected by m6 scTCR, 
because p2Ca is the naturally processed form of the peptide 
recognized by the alloreactive clone 2C and it has an even 
lower affinity than the QL9/L d complex for the 2C TCR (29). 
However, it remains to be determined if the levels of pMHC 
derived from endogenous antigen processing are sufficient to 
allow detection by using soluble TCR as probes. It is reason- 
able to predict that, in some cases, the level will be too low to 
distinguish from background by using standard flow cytometry 
procedures. 

The high-affinity receptors described in our study were de- 
rived by variation at the VJ junction, the same process that 
operates very effectively in vivo through gene rearrangements in 
T cells (2). The fact that we could readily isolate a diverse set of 
high-affinity TCR in vitro indicates that there is not a genetic or 
structural limitation to high-affinity receptors. This supports the 
view that inherently low affinities of TCRs found in vivo are 
caused by a lack of selection for higher affinity and perhaps a 
selection for lower affinity (5-7). In this respect, the higher- 
affinity TCRs now provide the reagents for directly testing 
hypotheses about the effects of affinity on T cell responses (4-7). 
It is interesting to note that similar arguments have been used to 
suggest that the kinetic properties of antibodies may also set an 
in vivo "affinity ceiling," above which there may not be a selective 
advantage to B cells (40). 

In addition to their utility for testing T cell responses, 
high-affinity TCRs can be engineered like antibodies to yield 
high-affinity, antigen-specific probes. Soluble versions of 
the high-affinity receptor can directly detect specific peptide/ 
MHC complexes on cells (Fig. 5). Thus, these engineered 
proteins have potential, for example, as tumor cell diagnostics, 
or on conjugation with cytotoxins, potential agents for cancer 
therapy. 
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